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Diplomatic service 


Government science advisers are unlikely to be specialists on the subject of a crisis, but they are key 
to bringing together relevant experts and disseminating the information clearly and accurately. 


the scientific advice those politicians receive? Academic wisdom on 

tap is becoming more common in policy-making. Almost 60 years 
after the launch of the first Soviet satellite Sputnik startled US President 
Dwight Eisenhower into appointing James Killian as his nation’s first 
scientific adviser, Killian’s modern equivalents from around the world 
will meet in New Zealand this week for their own conference. They will, 
presumably, keep their mobile phones switched on. 

Researchers and politicians want different things from these people. 
To working scientists, an adviser is one of their own who has the ear of 
government and has the chance to talk up the importance of science and 
to protect national investment in research. Politicians, however, want 
the stamp of peer-reviewed approval for their policies. Sometimes, they 
even want those policies to be based on the peer-reviewed evidence. 
Incidentally, none of these functions was a priority for Eisenhower, who 
wanted to harness domestic science to improve US economic and mili- 
tary prospects (see Nature 488, 559; 2012). 

Science advice to governments, either through the formal mechanism 
popular in the United States, Britain and some other European coun- 
tries, or through more ad hoc systems, often takes on an emergency- 
response and crisis-management role. This topic is a special focus for 
the New Zealand meeting. On page 360 of this issue, Nature highlights 
and analyses three examples in which scientists were at the centre of 
a national crisis. A volcanic eruption in Iceland in 2010 grounded 
flights across Europe and saw the crude output of atmospheric models 
discussed on the front pages of tabloid newspapers. And just as that 
emergency started to ease, experts in the United States had to wrestle 


I: all political careers end in failure, then what does that say about 


with the political, environmental and economic fallout of the fatal 
explosion of the BP Deepwater Horizon drilling rig and the massive 
oil leak into the surrounding Gulf of Mexico. Then, a year later, dozens 
of people in Germany were killed and hundreds more hospitalized in 
Europe’ worst recorded outbreak of Escherichia coli poisoning. 

Science and science advice received mixed reviews in each of these 
incidents. The research got there in the end: the oil spill was contained 
and quantified, and the source of the E. coli identified. Should another 
volcanic eruption burst through the Iceland ice — and one was threat- 
ening to as Nature went to press — regulators now have more refined 
models with which to work out the likely impact. But, too often, the 
scientific response to a problem is overtaken by events. 

A week isa long time in politics, but it is an eternity ina crisis. Britain's 
response during the 2010 eruption offers a model that could be emu- 
lated elsewhere. The country’s chief science adviser was able to tap into 
a previously developed crisis-response strategy and convene a broad 
panel of scientific and technical experts that included people both inside 
and outside government. Events still moved faster than the panel did, 
and key decisions were taken before the committee even met, but once 
the scientists gathered, they provided important advice that helped the 
government to decide how to respond to the developing emergency. 

Communication, of course, is key. Peter Gluckman, New Zealand’s 
chief science adviser, is hosting this week’s conference and wants it to 
spawn a network of science advisers who can learn from past crises and 
call on each other for help. “We're a small country. Ifsomething happens 
which is not in our area it would be nice to ring up someone in Europe 
who has the expertise,’ he says. Keep those phones on, folks. = 


People power 


Climate models must consider how humans are 
responding to a warming world. 


as the cosmologist Stephen Hawking noted: “They are not much 
use in predicting human behaviour because there are far too 
many equations to solve.” 

The motives, needs and desires that drive human action have long 
resisted rational analysis. From the volatility of the stock market to 
fads and fashions that flare brightly and then vanish, the ability of 
individuals to act unpredictably has undermined attempts to model 
their behaviour with any level of precision. 

The science-fiction writer Isaac Asimov had the right idea. If one 
considers a sufficiently large population of people, he wrote, then 


Pp hysics and mathematics can tell us how the Universe began, but 


just as the mass movement of a gas can be inferred through simple 
calculations — whatever the individual molecules might do — so too 
can the future actions of a large population. 

Asimov called his fictional science of predicting people's behaviour 
psychohistory. He used it as a central plank of his classic Foundation 
series of books. The predictions of psychohistory were more than a 
model, they were a set of instructions for how future societies must 
respond to a predictable crisis they helped to create. 

Ina Comment on page 365, Paul I. Palmer and Matthew J. Smith call 
for human adaptation to climate change to be modelled to help avert 
a real-life predictable crisis. Existing models of the planet’s changing 
climate are insufficient, they argue, because they leave out the people. 
Omitting human behaviour from these mathematical studies, they 
write, is like “designing a bridge without accounting for traffic”. 

Societies will be different in a warmer world, they point out, and 
we should understand how this will unfold. It is, in essence, another 
feedback in the climate system, and one that should be quantified and 
accounted for. Perhaps another seven billion equations will need to 
be added to the mix. = 
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WORLD VIEW jecnisicor sen 


United States they remain unregulated, but 8 August marked 

the end of the public comment period on a proposal by the US 
Food and Drug Administration (FDA) that would bring e-cigarettes 
under its authority. Now the FDA must act. 

That action must be based on science. The FDA’s background 
document explains: “We do not currently have sufficient data about 
these products to determine what effects e-cigarettes have on the public 
health. Only if e-cigarettes are deemed a tobacco product, as under 
the proposal, can the agency begin to collect the data that can permit 
it to “account for the net public health impacts” On the basis of this 
assessment, the FDA can decide how to craft regulations to protect and 
improve public health. 

When it comes to the direct impact of these 
devices on public health, the tenor of the FDA’s 
language, and of comments from many relevant 
organizations, is precautionary. The American 
Cancer Society (ACS) says: “Until electronic ciga- 
rettes are scientifically proven to be safe and effec- 
tive, ACS will support the regulation of e-cigarettes 
and laws that treat them like all other tobacco 
products.’ The Forum of International Respira- 
tory Societies goes further, saying that the risks of 
e-cigarettes have not been adequately studied and 
as a precaution, such devices “should be restricted 
or banned until more information about their 
safety is available”. A group of 29 state attorneys 
general sent a 33-page letter to the FDA arguing 
that “e-cigarettes contain and deliver nicotine — a 
well-recognized addictive chemical — in amounts 
comparable to traditional cigarettes. Accordingly, 
e-cigarettes should be assumed to be both harmful and addictive” 

Bollocks. Let’s do a thought experiment. Imagine that every smoker 
in the United States changed to e-cigarettes. What would be the conse- 
quences? An e-cigarette, in essence, allows you to be addicted to nicotine 
(which is not carcinogenic), and to enjoy the tactile pleasures of smok- 
ing without exposing yourself to the 60 or more cancer-causing agents, 
or to most of the hundreds of other toxic chemicals, that are released 
from burning tobacco. If all US smokers ‘vaped’ (the verb coined to 
distinguish inhaling e-cigarette vapours from inhaling tobacco smoke) 
instead of smoked, about 480,000 deaths might eventually be avoided 
per year. We may never approach such a full transition, but the point is 
that the causal relationship between inhaling tobacco smoke and dying 
from cancer and other diseases is very robust. 

How many people would e-cigarettes kill 


Peess cigarettes are growing rapidly in popularity. In the 


instead? Evidence of the effects of widespread DNATURE.COM 
vaping is limited and contradictory.Some stud- _ Discuss this article 
ies show that e-cigarettes can wean people from __ online at: 


smoking, others suggest that the effectis, atbest, _go.nlafure.com/fyghwr 


MAKE SMOKING 
UNCOOL, 


EXPENSIVE 
AND STUPID, 


AND VAPING 


COOL AND 
SMART. 


Allow use of electronic 
cigarettes to assess risk 


Monitoring the outcomes of incentivized e-cigarette use, not endless research, 
will be the key to sensible regulation, says Daniel Sarewitz. 


modest and short-lived. Still others suggest that anything that makes 
vaping seem desirable might coax non-smokers to smoke. 

Unanticipated potential risks are being discovered and debated. 
Studies have shown that e-cigarette vapour includes various fine par- 
ticulates, some of which are toxic; other research indicates that expo- 
sure levels are too low to be dangerous. More science will expand the 
evidence and the potential risks, but as complexities and questions 
emerge it will also increase, rather than reduce, the contradictions 
and uncertainties. The extraordinary difficulty of demonstrating the 
benefits of salt reduction, mammography, or various diets, for exam- 
ple, ought to serve as cautionary lessons. Given the millions who will 
die from smoking in the near future, does it make sense to spend years 
discovering, characterizing and debating ancillary risks of vaping that 
are almost certainly less serious than the known 
risks of smoking as a precondition for responsi- 
ble policy-making? This is precaution? 

E-cigarettes must be regulated. Ingredi- 
ents should be labelled. No responsible voices 
would allow them to be sold to children. Such 
requirements are already in force in the Euro- 
pean Union. The more important question is 
whether regulation should be driven by the risks 
of e-cigarette use, or by the risks of not using 
them. The former promises endless research, 
uncertainty, and debate; the latter may offer a 
technological short-cut to solving one of the 
world’s most serious public-health problems. 

No one knows to what extent vaping will 
displace smoking, but the sure way never to find 
out is to make policies hostage to endless stud- 
ies on population-wide risks. Instead we should 
test the effectiveness of policies, perhaps in limited jurisdictions, that 
encourage vaping among smokers and potential smokers. Keep the tax 
burden, and thus cost, low relative to cigarettes. Allow advertisements. 
With George Clooney. Continue to allow vaping in bars, restaurants and 
workplaces. Make smoking uncool, expensive and stupid, and vaping 
cool and smart. If people must get addicted to something, let them get 
addicted to a thing that does not give them or their families cancer. And 
carefully monitor the outcomes. 

As research for this column, I tried vaping. The taste was perfectly 
pleasant, and my office colleagues said that the white-ish clouds I 
exhaled had no smell. 1am not a smoker, so I cannot rate how the overall 
experience compares to cigarettes, and I have not vaped enough to rec- 
ognize any physiological response. But then again, Iam already addicted 
to the pleasures of caffeine, and one costly addiction is enough. m 


Daniel Sarewitz is co-director of the Consortium for Science, Policy and 
Outcomes at Arizona State University, and is based in Washington DC. 
e-mail: daniel.sarewitz@asu.edu 
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RESEARCH HIGHLIGHTS 


BIOTECHNOLOGY 


Yeast turned into 
opioid-makers 


Baker's yeast can be made to 
produce morphine, codeine 
and other pain medicines at 
high levels. 

Opioid production is 
subject to the vagaries of the 
opium poppy supply chain, 
so Christina Smolke and 
her colleagues at Stanford 
University in California 
wanted to find alternative 
ways of making the drugs. 
They engineered the yeast 
(Saccharomyces cerevisiae) to 
express genes from the poppy 
(Papaver somniferum) and 
the bacterium Pseudomonas 
putida M10, then cultured 
it with thebaine, an opioid 
intermediary molecule 
extracted from the poppy. The 
yeast synthesized high enough 
levels of several natural and 
semisynthetic opioids to 
make the method potentially 
useful to the pharmaceutical 
industry, the authors say. 

The next step, they add, 
is to engineer yeast to make 
these painkillers from simple 
sugars, eliminating the need 
for poppies altogether. 

Nature Chem. Biol. http://dx.doi. 
org/10.1038/nchembio.1613 
(2014) 


ASTRONOMY 


Collision history 
written in rock 


Meteorites recovered in 
California have yielded 
details about their collision- 
filled journey from the Solar 
System's asteroid belt. 
The fragments 
(pictured) originated 
from a meteoroid 
whose fiery 
descent lit up the 
night sky over 
San Francisco 
in 2012. Peter 


Selections from the 
scientific literature 


MARINE ECOLOGY 


Sick reefs repel young coral 


Coral larvae actively avoid the smell of 
degraded marine ecosystems — potentially 
impeding efforts to rebuild damaged reefs. 
Mark Hay at the Georgia Institute of 
Technology in Atlanta and his team studied 
coral reefs near Fiji (pictured). They focused 
on the behaviour of coral larvae in water 
from healthy, protected reefs and from reefs 
overgrown with seaweed. 
Given the choice between the two streams of 


Jenniskens of NASAs Ames 
Research Center in Moffett 
Field, California, and his 
colleagues analysed the six 
fragments and concluded that 
the meteoroid is one of several 
thought to have come from a 
parent asteroid that shattered 
in a collision 470 million years 
ago. This meteoroid broke 

apart again in collisions 
9 million and 

4 million years ago. 


100,000 years. 


The team also detected 
traces of carbon-rich organic 
molecules that somehow 
managed to survive. 

Meteorit. Planet. Sci. 49, 
1388-1425 (2014) 


Liquid layer for 
lung defence 


water, Acropora coral larvae spent more than 
85% of their time in water from the protected 
areas. The organisms also showed a similar 
preference for clean water containing chemical 
cues from a variety of corals, whereas seaweed 
cues decreased this preference. 

Conservationists will probably need to boost 
coral-attracting chemicals when rebuilding 
seaweed-choked reefs, the authors suggest. 
Science 345, 892-897 (2014) 


Saskatchewan in Saskatoon, 
Canada, developed an imaging 
technique to visualize the 
depth of the protective layer 
that lines the airways. The 
researchers found that when 
the bacterium Pseudomonas 
aeruginosa was introduced 
into isolated pig tracheas, the 
airways secreted more of this 
film than unexposed tracheas. 

The results could explain 
why patients with cystic 


Measurements 

of accumulated 

radiation in the 
rock suggest that 
it suffered a final 
collision within the past 
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Inhaling bacteria increases 
the production of mucus 
and liquid that trap and kill 
microbes in the airways. 

A team led by Juan 
Ianowski at the University of 


fibrosis, who do not produce 
this layer properly, are prone 
to respiratory infections and 
inflammation. Boosting this 
previously unknown part of 
the immune response in these 
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patients could improve their 
lung function, the authors say. 
Proc. Natl Acad. Sci. USA 
http://doi.org/vbc (2014) 


| ___NEUROSCIENCE 
Light signals boost 
stroke recovery 


Mice that have had a stroke 
regain motor function after 
specific brain neurons are 
stimulated by light. 

Using a technique called 
optogenetics, Michelle 
Cheng, Gary Steinberg and 
their colleagues at Stanford 
University in California 
studied mice that were 
genetically engineered to 
express a light-sensitive 
protein in a group of neurons 
in the brain’s motor cortex. 

The team induced a stroke 
on one side of this area and 
then activated the engineered 
neurons by switching ona 
small, implanted light probe. 
This stimulation increased 
blood flow and the levels of 
various growth factors in the 
opposite, uninjured cortex. 
Moreover, these animals 
gained more weight and 
walked faster and farther than 
unstimulated stroke mice. 

Further study of the 
mechanisms of this recovery 
could lead to new drug targets 
or therapy techniques for 
stroke, the authors say. 

Proc. Natl Acad. Sci. USA 
http://doi.org/vbj (2014) 


Earth’s lakes 
added up 


The world is home to 
117 million lakes, according to 
a satellite-based survey. 
Lakes are important to 
the planet’s carbon cycle, but 
researchers have struggled 
to estimate the number that 
exist, in part because small 
lakes have been difficult to 
count. A team led by Charles 
Verpoorter of the University 
of Lille Nord in France used 
high-resolution imagery 
from the Landsat satellite to 
catalogue all lakes larger than 
2,000 square metres. 


Unlike earlier estimates, 
the total count is dominated 
by large and medium- 
sized lakes, not small ones. 
Together, the lakes cover 
about 5 million square 
kilometres, or 3.7% of Earth’s 
land not covered by ice. 
Geophys. Res. Lett. http://doi. 
org/t9v (2014) 


VIROLOGY 


Polio killed the 
vaccinated 


The poliovirus strain that 
caused an outbreak in the 
Republic of the Congo in 2010 
is able to resist the immune 
responses generated by a 
commonly used vaccine. The 
finding could explain why the 
outbreak, which killed nearly 
half of the 445 people infected, 
was So severe. 

Christian Drosten at the 
University of Bonn Medical 
Centre in Germany and 
his colleagues analysed the 
virus strain responsible for 
the outbreak. They founda 
combination of two mutations, 
both in the proteins of the 
strain’s ‘coat, which make it 
harder for certain antibodies 
to stick to the virus. People 
who died in this outbreak had 
been vaccinated in the past, 
but people who were freshly 
re-vaccinated could fight off 
the virus. 

The authors warn that other 
resistant strains could emerge, 
even as the world is close to 
eradicating the disease. 

Proc. Natl Acad. Sci. USA 
http://doi.org/vbg (2014) 


Algal boom and 
bust tracked 


Viruses that infect and kill 
algae could influence the 
ocean carbon cycle. 

Ilan Koren and Assaf Vardi 
at the Weizmann Institute of 
Science in Rehovot, Israel, and 
their colleagues used satellite 
imagery to measure the 
atmospheric carbon absorbed 
bya roughly 1,000-square- 
kilometre algal bloom during 
its 25-day life cycle in the 


RESEARCH HIGHLIGHTS BiiSaiaa¢ 


SOCIAL SELECTIO 


Popular articles 
on social media 


The cost of misconduct 


A report detailing the supposedly meagre costs of scientific 
misconduct has set off an online discussion about the real toll 
of shoddy science. Using the records of the US government’s 
Office of Research Integrity, researchers led by Ferric Fang 
at the University of Washington in Seattle collected studies 
funded by the National Institutes of Health (NIH) that had 
been retracted because of misconduct from 1992 to 2012. 
All told, the NIH had spent about US$58 million on these 
projects, less than 1% of its total budget over this time. On 
Google+, mathematician Joerg Fliege at the University 

of Southampton, UK, did what he does best: the maths. 
“Multiply this figure with 100 if you believe that only 1% of 
misconducts get detected,” he posted. “Still not much of an 
expense, in the overall scheme of things.” 


eLife 3, e€02956 (2014) 


Based on data from altmetric.com. 
Altmetric is supported by Macmillan 
Science and Education, which owns 
Nature Publishing Group. 


North Atlantic. They found 
that the algae converted 
around 22,000 tonnes of 
atmospheric carbon into 
organic carbon — about 

as much asa rainforest of 
equivalent size — before 
viruses caused the bloom to 
collapse. 

Two-thirds of this carbon 
was released back into the 
atmosphere within a week 
of the bloom’s collapse. 

The rest could have been 
transported deep into the 
ocean, as infected algae sank 
to the ocean floor, the authors 
suggest. 

Curr. Biol. http://doi.org/vbx 
(2014) 


PLANT SCIENCES 


Plants drink 
mineral water 


Thirsty plants can extract 
water from the crystalline 
structure of gypsum, a rock- 
forming mineral found in soil 
on Earth and Mars. 

Some plants grow on 
gypsum outcrops and 
remain active even during 
dry summer months, 
despite having shallow 


NATURE.COM 
For more on 
popular papers: 
go.nature.com/tuacod 


roots that cannot reach the 


water table. Sara Palacio of 
the Pyrenean Institute of 
Ecology in Jaca, Spain, and 
her colleagues compared the 
isotopic composition of sap 
from one such plant, called 
Helianthemum squamatum 
(pictured), with gypsum 
crystallization water and 
water found free in the soil. 
The team found that up to 
90% of the plant’s summer 
water supply came from 
gypsum. 

The study has implications 
for the search for life in 
extreme environments on this 
planet and others. 

Nature Commun 5, 4660 (2014) 


© NATURE.COM 

For the latest research published by 
Nature visit: 
www.nature.com/latestresearch 
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POLICY 


Science adviser 

The post of chief scientific 
adviser to the European 
Commission came under 
attack again on 19 August 
when nearly two dozen non- 
governmental organizations 
called on the incoming 
president of the commission 
to scrap the job. In an open 
letter to Jean-Claude Juncker, 
groups including Friends of 
the Earth Europe added their 
weight to an existing campaign 
to abolish the science role. The 
letter argues that the position 
“concentrates too much 
influence in one person”. The 
mandate of the commission’s 
current chief scientific adviser, 
Anne Glover, is scheduled to 
end later this year. 


Trial enrolment 

The US Food and Drug 
Administration published a 
plan on 20 August to ensure 
that women and racial and 
ethnic minority groups 

are adequately included in 
clinical trials. The action 

plan aims to collect data on 
population subgroups in 
trials, to encourage and enable 
more women and minority 
groups to enrol, and to make 
the demographic make-up of 
trials more transparent. It will 
be implemented in stages over 
the next five years. The agency 
also published guidance 

on evaluating how medical 
devices might function 
differently in men and women. 


EVENTS 


Icelandic volcano 
After a reported eruption 

on 23 August turned out to 

bea false alarm, Iceland’s 
Bardarbunga volcano 
continued to rumble deep 
underground. Since seismic 
activity began on 16 August, 
thousands of earthquakes have 
shaken the ground north and 


The news in brief 
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Satellites pinpoint ice loss 


The massive ice sheets in Antarctica (pictured) 
and Greenland are together shrinking at a rate 
of 500 cubic kilometres per year — the fastest 
pace since satellite altimetry began 20 years 
ago. The European Space Agency’s CryoSat-2 
probe collected precise elevation data for both 
ice sheets (Antarctica’s shown on right). The 


east of the volcano. They show 
where magma is squirting up 
from below and forming a 
freshly cooled sheet of rock, 

or dyke, a few kilometres 

deep. As of 25 August, the 
dyke was thought to be nearly 
35 kilometres long and to 
contain 300 million cubic 
metres of magma. Aviation 
authorities remained on alert 
in case an eruption spewed ash 
into the air. See go.nature.com/ 
iidaau for more. 


Ebola in fifth nation 


The Democratic Republic 

of Congo became the fifth 
African nation to confirm 
cases of Ebola, on 24 August. 
On the same day, a Senegalese 
epidemiologist was reported 
as the first person working 
for the World Health 
Organization (WHO) to 
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contract the disease. On 

22 August, the WHO said 
that the official count of 
2,615 cases and 1,427 deaths 
probably underestimates the 
true size of the epidemic. It 
blamed community resistance 
to reporting cases anda 

lack of adequate treatment 
facilities. The agency thinks 
that the epidemic could last 
for another 9 months. See 
page 355 for more. 


Wonhky orbit 

The latest pair of satellites 

in Europe’s Galileo global 
positioning system went into 
the wrong orbit after a botched 
launch from French Guiana 
on 22 August. The satellites 
were supposed to adopt a 
circular orbit, but are now 
travelling on an elliptical path 
and at a lower altitude than 


2,000 
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greatest ice loss (shown on left in red) between 
2011 and 2014 occurred at Pine Island glacier 
in western Antarctica and at Jakobshavn glacier 
in Greenland. The findings were reported 

on 20 August by researchers from the Alfred 
Wegener Institute in Potsdam, Germany 

(V. Helm et al. Cryosphere 8, 1539-1559; 2014). 


planned. Arianespace, the 
French company responsible 
for the launch, said that it is 
investigating whether the 
problem was due to the Soyuz 
launch rocket. The satellites 
are the fifth and sixth out 

ofa total constellation of 30 
Galileo spacecraft. 


Drone lawsuit 

An association of 188 US 
universities is taking the 
government to court in an 
attempt to overturn rules 
that restrict researchers’ use 
of drones. On 22 August, the 
Council on Governmental 
Relations in Washington DC 
filed a complaint in the 
federal appeals court, arguing 
that restrictions on the 
‘commercial’ use of drones — 
imposed by the US Federal 
Aviation Administration 


HELM ET AL./THE CRYOSPHERE 
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SOURCE: ASIAN DEVELOPMENT BANK 


(FAA) in June — unfairly 
hamper the application of 
drones in science (see also 
Nature 512, 231; 2014). The 
FAA says that commercial use 
includes research activities at 
private universities. 


PEOPLE 
Nuclear spy 


A former employee of the Los 
Alamos National Laboratory 
in New Mexico was convicted 
on 20 August of conspiring 
with her husband to sell US 
nuclear secrets to Venezuela. 
Marjorie Roxby Mascheroni, 
who was a writer and editor 
at the lab between 1981 and 
2010, was handed a 366-day 
prison sentence. The charges 
arose from the interactions 
of her husband, formerly a 
physicist at the lab, with an 
undercover FBI agent posing 
as a Venezuelan government 
official. Pedro Leonardo 
Mascheroni, who awaits 
sentencing, claims that his 
dealings were part of an 
attempt to fund construction 
ofa laser for nuclear fusion. 


Minister dismissed 


Iran's parliament voted 

on 20 August to dismiss 

the country’s science and 
technology minister, Reza 
Faraji-Dana (pictured), 

for attempting to liberalize 
universities and allegedly 
politicizing Iran’s academic 
environment. Faraji-Dana, an 


TREND WATCH 


Climate change will cause the 
collective economy of the six 
countries of South Asia to lose an 
average 1.8% of its annual gross 
domestic product by 2050, rising 
to 8.8% by 2100. The data, from 
the Asian Development Bank and 
published on 19 August, show 
that the Maldives and Nepal will 
be hardest hit (see map). The 
forecast assumes a 4.6 °C rise 

in global temperatures. South 
Asia will need to spend at least 
US$73 billion a year between 
now and 2100 to adapt to climate 
change, the report says. 


electrical engineer and former 
chancellor of the University of 
Tehran, had joined the cabinet 
of reformist president Hassan 
Rouhani last year. He upset 
hardline parliamentarians 
when he tried to allow activist 
students and professors back 
onto campus after they had 
been banned by Rouhani’s 
conservative predecessor 
Mahmoud Ahmadinejad 
following anti-government 
unrest in 2009. 


| RESEARCH 
MERS model 


Marmosets are the best 
animal model for Middle 
Eastern Respiratory Syndrome 
(MERS). Research on the 
MERS coronavirus was 
hindered by the lack of an 
animal model that showed the 
same respiratory symptoms 

as humans when infected 
with the virus. In two studies 
published on 21 August, 
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Predicted annual 
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countries by 2100 
caused by crop 
failures, coastal 
floods and 
more-widespread 
disease. 
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researchers from the US 
National Institute of Allergy 
and Infectious Diseases 
showed that the virus could 
infect marmosets, and that 
the animals’ symptoms mimic 
the severe pneumonia seen in 
humans (D. Falzarano et al. 
PLoS Pathog. 10, e1004250 
(2014); N. van Doremalen 
etal. J. Virol. 88, 9220-9232 
(2014)). 


African farming 

The Alliance for a Green 
Revolution in Africa (AGRA), 
a science-based non- 
governmental organization in 
Nairobi has helped 1.7 million 
African farmers to rejuvenate 
1.6 million hectares of land, 
and to double, or even triple, 
crop yields over the past five 
years through its Soil Health 
Programme. AGRA reported 
the results on 22 August. The 
programme tests and teaches 
techniques to improve soil 
fertility and makes chemical 
fertilizers more affordable 

for poor farmers. Depleted 
soils cost African farmers 
US$4 billion a year in lost 
productivity. 


Stem-cell go-ahead 
Regulators in the United 
States have cleared the 

way for a clinical trial ofa 
prospective stem-cell-derived 
treatment for type 1 diabetes. 
On 19 August, ViaCyte of 

San Diego, California, said 
that the Food and Drug 
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SEVEN DAYS | THIS WEEK | 


2-4 SEPTEMBER 
Agricultural scientists 
meet at the Africa Green 
Revolution Forum 

in Addis Ababa to 
discuss how to boost 
agricultural productivity 
in the developing world. 
www.agrforum.com 


5-9 SEPTEMBER 
Researchers meet 

at the Interscience 
Conference on 
Antimicrobial Agents 
and Chemotherapy 
in Washington DC 

to discuss infectious- 
disease control and 
prevention on a global 
scale. 
go.nature.com/8cjcaa 


Administration has given it 
permission for a phase I/II 
clinical trial of a product 
that consists of pancreatic 
precursor cells packaged 
ina mesh pouch. ViaCyte'’s 
treatment could become 

one of only a handful of 
human embryonic-stem- 
cell-derived products to be 
trialled in people, and will 
be an important test for the 
effectiveness of California's 
state stem-cell institute, which 
provided funding to develop 
the product. 


| BUSINESS 
Pharma takeover 


Swiss pharmaceutical 

giant Roche announced 

on 24 August that it will be 
purchasing the biotechnology 
firm InterMune for US 

$8.3 billion. Headquartered 
in Brisbane, California, 
InterMune owns a drug called 
pirfenidone for treatment 

of idiopathic pulmonary 
fibrosis. The drug is sold in 
Europe and Canada, but has 
yet to gain approval in the 
United States. 


> NATURE.COM 
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INFECTIOUS DISEASES 


A health-care worker dons protective gear before entering an Ebola treatment centre near Monrovia, Liberia. 


World struggles to stop Ebola 


Greater international assistance is needed to quell the epidemic, say health officials. 


BY ERIKA CHECK HAYDEN 


an Kelly felt as if he were entering a war 
D zone when he arrived at Connaught 

Hospital on 19 August. His friend 
Modupeh Cole, the physician in charge of the 
Ebola isolation ward at the hospital in Freetown, 
Sierra Leone, had died six days earlier. Marta 
Lado, a doctor from Spain, was caring for the 
ward's ten patients. “She was mopping the floors 
herself? says Kelly, an infectious-disease physi- 
cian and co-founder of the Wellbody Alliance, 


a non-profit health-care organization in Sierra 
Leone. The international aid group Médecins 
Sans Frontieres has called the world’s response 
to the Ebola epidemic in West Africa “danger- 
ously inadequate”. As Kelly travelled around 
Freetown, noticing closed clinics and health- 
care workers without adequate protective train- 
ing and equipment, he had to agree. 

“We're lacking the kind of passionate human- 
itarian response that we've seen for disasters like 
the Haiti earthquake or the Haiyan typhoon” 
in the Philippines, says Kelly, who is raising 


money through the University of California, San 
Francisco, to teach infection-control practices 
to health-care workers fighting the outbreak. 
More than 240 health-care workers have con- 
tracted Ebola in the current episode, including 
a Senegalese epidemiologist who is the first per- 
son deployed by the World Health Organization 
(WHO) to become infected with the disease. 
“We're around rock bottom,’ says Kelly. 

Weeks after the WHO declared the Ebola 
outbreak a public-health emergency of inter- 
national concern and the World Bank 
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> pledged to provide up to US$200 million in 
aid, enough resources to end the crisis have yet 
to materialize. The WHO reports that the Ebola 
outbreak has so far claimed 1,427 lives in four 
West African nations — more than all previous 
recorded outbreaks combined, and probably an 
underestimate (see ‘Out of control’). It says that 
the epidemic will last for another 6-9 months, 
and the scientists and doctors who are fighting 
it agree. “I can comfortably say that we will be in 
this for the next 4 to 9 months, not including the 
follow-up that will be required to deal with the 
post-traumatic effects,” says virologist Joseph 
Fair, a special adviser to the ministry of health 
in Sierra Leone. 

To end the crisis, developed countries and 
aid groups need to send more people with 
expertise in treating infectious diseases, 
public-health experts say. In Monrovia, 
Liberia's capital, only three Ebola treatment 
facilities were operating as of 22 August; the 
WHO says that, in the coming weeks, facilities 
will be needed to treat another 500 patients. 
The US Centers for Disease Control and Pre- 
vention in Atlanta, Georgia, has sent 60 peo- 
ple to West Africa to help with surveillance, 
disease tracking and education, but they do 
not treat patients. Health-care workers and 
experts are badly needed from other Western 
governments, says Sophie Delaunay, execu- 
tive director of Médecins Sans Frontiéres in 
New York. Many institutions and aid groups 
are even removing doctors from the region 


OUT OF CONTROL 


out of concern for the physicians’ safety. 
Affected countries must also do a better 
job of dealing with patients and educating the 
public about the disease, say doctors and sci- 
entists. Insensitivity on the part of government 
officials has sparked uprisings, including a riot 
in a quarantined neighbourhood of Monrovia 
on 20 August, says Bailor Barrie, co-founder of 


the Wellbody Alliance. 

Officials are likely to make more progress 
fighting Ebola’s spread 
if they explain their “We’reat the 
actions and show point where 
concern for those almost anything 
affected, he adds, cit- wedototurn 
ing his own experi- this around will 
ence ina villagein make things 
Kono, Sierra Leone, better.” 


earlier this month. 

After a villager there tested positive for Ebola, 
police put two houses under quarantine — and 
six people fled. Barrie, along with community 
leaders and public-health workers, then visited 
the remaining 30 residents to explain the rea- 
son for the quarantine and to answer questions. 
One man said that he needed someone to go and 
buy cigarettes, and another needed palm wine; 
health officials tasked a worker with making 
these daily deliveries. The inhabitants stayed in 
quarantine for the full 21-day isolation period, 
even as three of them developed Ebola-like 
symptoms. Diagnostic tests were negative, and. 
no one in the houses came down with the virus. 


The death toll from Ebola virus in West Africa continues to rise. Infectious-disease experts say that more 


health-care workers are needed to contain the outbreak. 
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8 August: The WHO declares outbreak a public- 
health emergency of international concern. 


4 August: The World Bank pledges up to 
US$200 million to contain outbreak. 


20 June: Médecins Sans 
Frontiéres says outbreak 
is "totally out of control". 
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“When you don’t engage people, and just tell 
them what is right without listening to them, 
they will not comply with your orders,’ says 
Barrie. “That's why the initial response in some 
countries was full of denial, fear and terror — 
everyone was afraid.” 

Fair says that governments also need 
to undertake widespread, country-wide 
campaigns, “blasting all forms of media — 
billboards, radio jingles, T-shirts, pens... basi- 
cally, inundating the public with so much 
information that they can’t not hear it and 
remember it”. 

But, ultimately, the most effective tactic to 
fight Ebola’s spread is to train West Africans 
in basic public-health measures, says Daniel 
Bausch, a physician at Tulane University in 
New Orleans, Louisiana, who has treated the 
disease in Guinea and Sierra Leone. “I’m not 
naive enough to think it’s an easy thing to do, 
but it’s the only thing that’s actually going to 
work,” he says. 

The experience of groups such as Last Mile 
Health, a non-profit organization that operates 
in Boston, Massachusetts, and in Monrovia, 
shows that such training can work. Last Mile 
Health has deployed 150 community health 
workers to educate rural Liberians about Ebola. 
It has also worked to provide basic materials to 
care for patients. Rajesh Panjabi, the organiza- 
tion's chief executive, says that in mid-August, 
nurses at the Martha Tubman Memorial Hos- 
pital in Zwedru — which serves 130,000 people 
in rural eastern Liberia — were preparing to 
strike over a lack of protective equipment. Last 
Mile Health provided gloves, masks, gowns and 
training in how to use them, and the nurses con- 
tinued working. (Liberia's ministry of health 
had estimated earlier this month that the coun- 
try needs more than 451,360 pairs of medical 
gloves.) “This is the type of impact you can have 
through very basic activities, but it takes sus- 
tained response and investment,’ says Panjabi. 

There are other hopeful signs. At Connaught 
Hospital, nurses from countries including Sierra 
Leone have joined Lado, the physician leading 
the clinical effort of the King’s Sierra Leone 
Partnership, an initiative of the King’s Centre 
for Global Health in London. Two doctors and 
another nurse from the partnership will arrive 
at Connaught in the coming weeks. “We're at the 
point where almost anything we do to turn this 
thing around will make things better,’ says Kelly. 
“It’s got to be one hospital at a time? m 
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Eucalyptus plantations near Sao Paulo in Brazil. 


BIOTECHNOLOGY 


Brazil considers 
transgenic trees 


Genetically modified eucalyptus could be a global test case. 


BY HEIDI LEDFORD 


iewed from above, Brazil’s orderly 

\ | eucalyptus plantations offer a stark con- 

trast to the hurly-burly of surrounding 

native forests. The trees, lined up like regiments 

of soldiers on 3.5 million hectares around the 

country, have been bred over decades to grow 
quickly. 

On 4 September, a public hearing will 
consider bringing an even more vigorous recruit 
into the ranks: genetically engineered eucalyp- 
tus that produces around 20% more wood than 
conventional trees and is ready for harvest in 
five and a half years instead of seven. Brazilian 
regulators are evaluating the trees for commer- 
cial release; a decision could come as early as the 
end of this year. 

Researchers, businesses and activists are 
watching closely. Eucalyptus (Eucalyptus spp.) 
— native to Australia — is grown on about 
20 million hectares throughout the tropics 
and subtropics, and approval of the genetically 
engineered trees in Brazil could encourage their 
adoption elsewhere. “It would have ripple effects 
worldwide,’ says Zander Myburg, who studies 
the genetics of forest trees at the University of 
Pretoria in South Africa. “Everybody will pay 
attention.” 

So far, no genetically modified tree from a 
major commercial species has been deployed on 


alarge scale. The ubiquity of eucalyptus makes 
Brazil’s decision on the modified trees a special 
concern to environmental activists who oppose 
the use of genetically modified crops. 

“They have become the target of very 
intensive and emotionally charged debate 
particularly among the NGOs and nature 
constituencies,” says Walter Kollert, a forestry 
officer with the Food and Agriculture Organiza- 
tion of the United Nations in Rome. 

A consortium of activists opposed to the plan 
intends to present a letter at the 4 September 
meeting, urging Brazil’s National Techni- 
cal Biosafety Commission to reject the trees. 
In all, 259 organizations — 106 of them from 
Latin America — have signed the letter, which 
expresses concern that the trees pose risks to the 
environment and will encourage the expansion 
of plantations. 

The trees were developed by FuturaGene, a 
biotechnology firm in Rehovot, Israel, that was 
spun out of the Hebrew University in Jerusa- 
lem in 1993. The company found that certain 
proteins accelerate plant growth by facilitating 
cell-wall expansion. FuturaGene inserted into 
eucalyptus a gene that encodes one such pro- 
tein from thale cress (Arabidopsis thaliana), a 
common laboratory plant. In 2010, the firm was 
bought by Suzano Pulp and Paper of Sao Paulo, 
Brazil, one of the world’s largest producers of 
eucalyptus pulp. 
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FuturaGene’s chief executive Stanley Hirsch is 
quick to point out the environmental benefits of 
his company’s creation. The tree's speedy growth 
boosts absorption of carbon dioxide from the 
air by about 12%, he says, aiding in the fight to 
reduce greenhouse-gas emissions. The geneti- 
cally modified trees may also require less land 
to produce the same amount of wood, reducing 
the conversion of natural forest into plantations. 

Hirsch says that the company has tried to 
avoid public-relations mistakes made by agri- 
cultural biotechnology companies in the past: 
rather than shun activists, he has invited them 
to tour the company’s field-trial sites. “Some of 
them were so surprised,” he says. “They said, 
‘Wow, these look just like normal trees” 

Hirsch’s pitch has not convinced everyone. 
Anne Petermann, executive director of the non- 
profit organization Global Justice Ecology Pro- 
ject in Buffalo, New York, says that FuturaGene 
is trying to stave off opposition by ‘greenwash- 
ing’ its product. Faster-growing trees require 
more water and extract more nutrients from 
the soil, she adds, and they will only add to the 
economic incentive to seed more plantations. 

Genetically engineered trees do pose some 
biosafety issues that do not apply to agricultural 
crops such as maize (corn) or soya, notes forest 
geneticist Steven Strauss of Oregon State Uni- 
versity in Corvallis. They remain in the envi- 
ronment for years, increasing their potential 
impact on the plants, animals and soil around 
them. And trees tend to disperse pollen further 
than crops nearer the ground do, raising con- 
cerns about gene flow to native relatives. But 
eucalyptus has no native relatives in Brazil and 
is not particularly invasive in most areas of the 
country, says Strauss. 

FuturaGene says that it identified no major 
environmental problems in eight years of field 
trials that collected data on everything from 
gene flow to leaf-litter decomposition to the 
composition of honey made by bees that visit 
the trees. Myburg, who does not work with 
FuturaGene but is familiar with the company’s 
safety data, says that he found the firm's studies 
to be well designed and thorough. 

While FuturaGene tests the waters in Brazil, 
a US company awaits a regulatory decision 
regarding its genetically engineered, freeze- 
tolerant eucalyptus. In 2008, ArborGen of 
Ridgeville, South Carolina, petitioned the 
US Department of Agriculture to allow com- 
mercialization of the trees in the southeastern 
United States. Delays of this length are not 
uncommon in the US regulatory system, says 
ArborGen’ director of regulatory affairs Leslie 
Pearson. 

For now, just the prospect that the trees 
might be approved has been enough to rally 
activists. “The fact that there are now two com- 
mercial applications has the movement against 
genetically modified trees mobilizing quickly 
in many regions,’ says Petermann. “We know 
were going to be seeing a lot more coming out 
from this industry.’ m 
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CONSERVATION 


Ecotourism rise hits whales 


Desire to observe whales and dolphins up close is affecting animals’ behaviour. 


BY DANIEL CRESSEY 


may increasingly be putting the survival 
of marine mammals at risk, conserva- 
tionists have warned. 

Research published this year shows that the 
jaunts can affect cetacean behaviour and stress 
levels in addition to causing deaths from col- 
lisions. But some animals are affected more 
than others and the long-term effects remain 
unclear, scientists at the International Marine 
Conservation Congress (IMCC) in Glasgow, 
UK, heard last week. 

“Whale-watching is traditionally seen as 
green tourism,’ says wildlife biologist Leslie 
New of the US Geological Survey in Laurel, 
Maryland. “The negative is the potential for 
disturbance. That disturbance is a worry 
because we don’t want to do ‘death by 1,000 
cuts?” 

The number of people joining trips has 
expanded hugely since the 1990s, from 4 mil- 
lion in 31 countries in 1991 to 13 million in 
119 countries in 2008, the most recent year for 
which full data are available. In 2008, the Inter- 
national Fund for Animal Welfare, an animal- 
protection charity in London, estimated the 
value of the industry at US$2.1 billion. 

Although collisions with boats can hurt the 
animals, researchers are more concerned about 
effects such as animals failing to feed or using 
up energy swimming away from the vessels. 


B oat trips to watch whales and dolphins 


Whale-watching trips often come into very close contact with the animals. 


These seemingly small events can add up, 
studies suggest. 

Earlier this year, for example, marine 
biologist David Lusseau of the University 
of Aberdeen, UK, and his team showed that 
minke whales (Balaenoptera acutorostrata) in 
Faxafldi Bay in Iceland responded to whale- 
watching boats as they do to natural preda- 
tors, upping their 


speed and respir- “The disturbance 
ing more heavily’. 1sa@worry 
But whether this because we don’t 


want to do ‘death 
by 1,000 cuts’.” 


was a direct result 
of the boats is dif- 
ficult to pin down: 
Lusseau, who was not at the meeting, says that 
soon-to-be-published research by his team 
shows that behavioural changes are probably 
not affecting actual numbers of the minke in 
Faxafldi Bay. 

But Lusseau’s group has also shown that the 
bottlenose dolphins (Tursiops sp.) in Doubt- 
ful Sound, New Zealand, could be driven to 
extinction in decades’. The large number of 
dolphin-watching trips in the sound is driving 
the animals away from their preferred areas and 
forcing them to avoid boats instead of feeding. 
Dolphin numbers declined from 67 in 1997 to 
56 in 2005, the team found. 

Several delegates at the IMCC also described 
the effects on the roughly 70 endangered 
Irrawaddy dolphins (Orcaella brevirostris) 
living in the Mekong River between Cambodia 


“4 
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and Laos, which are hounded by scores of 
tourist boats. 

Determining which populations are most 
at risk could help to fix the problem, says 
Lusseau. He suggests plugging short-term 
observational data into longer-term popula- 
tion models to tease out whether behavioural 
changes are temporary or serious long-term 
threats. There are enough data on species 
types and locations to assess, at least roughly, 
where whale-watching should and should not 
be allowed, he says. But funding and political 
support are hampering the creation of detailed, 
localized plans. “There is a lot of lip service 
being paid to understanding the challenges 
tourism poses on wildlife, but in practice there 
is very little financial interest in finding this 
out,” he says. 


SHORT-TERM FIX 

Guidelines such as specifying minimum 
distances between animals and boats, speed 
limits or no-go areas, can help. But codes vary 
widely: a 2004 study’ found that just 38% were 
binding; the rest were voluntary. They are 
also often inadequate. Even with guidelines 
in place, boats in the dolphin-watching haven 
of the Bocas del Toro region of Panama hit 
and killed at least 10 animals in a population 
of about 250 in 2012 and 2013, according to 
research presented to the International Whal- 
ing Commission in Cambridge, UK, this year. 

Greg Kaufman, executive director of the 
Pacific Whale Foundation — an organiza- 
tion in Hawaii that runs whale-watching and 
research trips — holds up the Irrawaddy dol- 
phin as a population desperately in need of 
protection. “They’re basically killing these 
animals one at a time,” he says. 

But Brian Smith, a zoologist at the Wildlife 
Conservation Society in New York who has long 
studied the Irrawaddy group, says that although 
cetacean tourism is probably stressful for these 
animals, the main problem is entanglement in 
fishing nets. And the alternative to fishing for 
many people in the region is dolphin-watching. 

Most of the speakers at the IMCC meeting 
agreed that more should be done to protect 
dolphins and whales from tourists. “Although 
whale-watching is not as bad as whaling,” says 
New, “it might be that last piece that pushes a 
species over.’ 

1. Christiansen, F., Rasmussen, M. H. & Lusseau, D. 

J. Exp. Mar. Biol. Ecol. 459, 96-104 (2014). 

2. Lusseau, D., Slooten, L. & Currey, R. J.C. Tourism 
Mar. Environ. 3, 173-178 (2006). 


3. Garrod, B. & Fennell, D. A. Ann. Tourism Res. 31, 
334-352 (2004). 
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Steep mountains in Himachal Pradesh, India, give rise to a rich variety of species. 


CLIMATE CHANGE 


Himalayan plants 
seek cooler climes 


Race is on to record mountain biodiversity before it is lost. 


BY T. V. PADMA IN SHIMLA, INDIA 


altitude are so dramatic and steep that allu- 

vial grasslands, subtropical forests, conifers 
and alpine meadows lie stacked almost on top 
of each other, producing a spectacular range of 
vegetation. Now, the myriad plants that inhabit 
these mountains are migrating upwards 
because of climate change — and some are in 
danger of being lost before anyone has even 
recorded their existence. 

“Indian scientists have not documented 
many of the species in the Western Himalayas, 
so one does not know what species existed in 
the first place, and where they have shifted to,” 
says Vaneet Jishtu, a botanist at the Himalayan 
Forest Research Institute in Shimla in the north 
Indian state of Himachal Pradesh. Jishtu and his 
team are midway through a five-year project to 
catalogue the upward migration. 

Although melting glaciers are often the focus 
of climate-change concerns in the Himalayas, 
the range is also home to one-tenth of the 
world’s known higher-altitude plant and animal 
species, and half of India’s native plant species. 
Particularly rich in biodiversity are the Western 
Himalayas that, in India, include the states of 
Himachal Pradesh, Jammu and Kashmir, Utta- 
rakhand and Sikkim (see ‘Peak plants’). There, 
mountains that rise steeply from 300 metres to 
more than 6,000 metres have diverse ecosystems 
and act as a natural barrier to species migra- 
tion. But data collection on biodiversity in the 
Himalayas is sporadic — and there is much less 


E: India’s Western Himalayas, changes in 


information than from the European Alps or the 
Andes, says Nakul Chettri, coordinator of the 
transboundary landscapes programme at the 
International Centre for Integrated Mountain 
Development (ICIMOD) in Kathmandu. 

The studies that do exist tend to be on well- 
known species, but give an idea of the extent of 
the upward shift. Over the past decade, tropical 
plants such as the nuisance weed Parthenium 
hysterophorus and the water fern Azolla cristata 
have settled in the formerly temperate climate 
of the Kashmir Himalayas, according to stud- 
ies by Zafar Reshi, a botanist at the University 


PEAK PLANTS 


Half of. India's native plant species are 
concentrated in its Himalayan states. 
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of Kashmir in Srinagar in Jammu and Kashmir 
state (see, for instance, B. Ahad et al. Am. Fern 
J. 102, 224-227; 2012). He also notes that the 
flowering and fruiting cycles of pear and apple 
trees — for which the region is famous — have 
shifted in line with rising temperatures during 
the past 50 years, leading to changes in fruit size, 
colour and taste (B. Basannagari and C. P. Kala 
PLoS ONE 8, e77976; 2013). 

Himalayan blue pines (Pinus wallichiana) 
are on the move too. They are now seen at 
heights of 4,000 metres, whereas two or three 
decades ago they grew at altitudes no higher 
than 3,000 metres, says a team led by Sher Singh 
Samant, head of the biodiversity and conserva- 
tion team at the Himachal Pradesh unit of the 
G. B. Pant Institute of Himalayan Environment 
and Development. “In the past, heavy snowfall 
in higher regions prevented the upward shift of 
species. Now there is less snow and it has started 
melting faster, bringing about changes in veg- 
etation and alpine meadows,’ he says. Even 
species among the glacier-deposited rocks at 
4,500-5,500 metres are moving upwards to 
cooler climes, he notes. 

Researchers know much less about — or have 
not even documented — many of the region's 
native species, especially ones that inhabit high- 
altitude transition zones, or ecotones, where 
one ecosystem gives way to another. Species 
in these zones are most affected by climatic 
change, says Chettri: “They either have to adapt 
by shifting to higher altitudes, or they die” 

Such fears prompted the launch in 2012 of 
Jishtu’s five-year project to assess the impact of 
global warming on transition zones in Himachal 
Pradesh. “We are still collecting and analysing 
data,” he says. He is also setting up an arboretum 
in Shimla to conserve Himalayan plants that are 
endangered or rapidly disappearing. 

One problem with existing Himalayan bio- 
diversity assessments is a tendency to limit 
them to protected areas and certain species, 
says Chettri. This affects data on animals as 
well as on plants. He cites a draft ICIMOD 
analysis of the Eastern Himalayas that noted 
the cataloguing of biodiversity there is skewed 
towards charismatic species such as the red 
panda (Ailurus fulgens), with lower vertebrates 
and invertebrates sidelined. Even preliminary 
information about algae, fungi, insects, mosses 
and ferns is still scarce in the Kashmir Himala- 
yas in India, adds Reshi. 

The data that do exist can be hard for 
researchers to find. Past records of the region's 
biodiversity are scattered, inaccessible or lost 
in obscure natural history museums, says 
Reshi. “There is little systematic digitization 
of the collected data, and access to the data is 
limited, leading to duplication and misuse of 
resources and time,’ adds Chettri. 

And time could be in short supply: species 
can migrate upwards for only so long before 
they hit inhabitable terrain. The upper Hima- 
layas comprise mostly rocks, without soils to 
support plant growth. = 
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CRISIS COUNSELLORS 


Volcanic eruptions, oil spills and bacterial outbreaks allland in the laps of 
government science advisers, and put them to the test. 


BY ALEXANDRA WITZE, LAUREN MORELLO & MARIAN TURNER 


s a population biologist, John 

Beddington spent most of his career 

studying fisheries rather than wor- 

rying about volcanoes. But then 

came April 2010, when Beddington 

— the UK government chief scien- 
tific adviser — found himself having to figure 
out not only how to pronounce Eyjafjallajékull, 
but also what to do about the eruption of the 
Icelandic volcano. 

In the small hours of 14 April, the volcano 
had gone from its previous state — pictur- 
esquely spitting out lava — to violently spewing 
plumes of ash high into the atmosphere. Winds 
were blowing to the south and east, where the 
fine ash presented a threat to Europe's busy 
commercial airline routes. Suddenly scientists 
were scrambling not only to understand how 
much ash the volcano was generating, but also 


how it was spreading through the atmosphere 
and how much ofa risk it presented to aircraft. 
So Beddington gota call at his Cotswolds home 
summoning him to 10 Downing Street. “T sort of 
dusted off my brain and went into the meeting,” 
says Beddington, who is now at the University 
of Oxford. 

In the first week of the crisis, authorities pro- 
gressively closed airspace where the volcanic 
ash was billowing. Ultimately more than 300 
airports were shuttered across Europe, strand- 
ing some 8.5 million passengers and causing 
major economic losses to the airlines and busi- 
nesses that depend on them. Each country made 
the decision about its own airspace, which put 
Beddington front and centre of helping UK 
officials figure out what to do. 

When scientists enter government in the 
role of a scientific adviser or as the head of 
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The eruption of 
Eyjafjallajokull in 
Iceland stranded 

millions of people 
around the world. 


a science agency, they 
need to be prepared for 
the unexpected. Some of 
their most crucial con- 
tributions come during 
crises, a theme that will 
be explored on 28-29 August at a global sum- 
mit of science advisers in Auckland, New Zea- 
land. On the eve of that meeting, Nature takes a 
look at how such officials performed during the 
Eyjafjallajékull eruption, as well as the 2010 oil 
spill in the Gulf of Mexico and a deadly disease 
outbreak in Europe the following year. 

These cases show that science advisers have 
key roles in a crisis, especially in disseminating 
clear, reliable information to government lead- 
ers and the public. But at times, they struggle 
with the demands presented by disasters: rare 
events can take them by surprise, bureaucracy 


ORSOLYA HAARBERG/NATURE PICTURE LIBRARY 


WHP/ALAMY 


can strangle their attempts to respond and they 
often cannot keep pace with the evolving situa- 
tion. “We have to form a view about advice for 
the government,” Beddington says. “And we 
have to do that on a fairly quick time scale” 


FLIGHT RISK 


After leaving his meeting with the prime 
minister, Beddington began to round up a 
panel of volcanology and meteorology experts 
to form an Eyjafjallajékull-focused Scientific 
Advisory Group for Emergencies (SAGE), the 
UK government's main mechanism for gath- 
ering technical advice and passing it along to 
decision-makers in crises. 

The SAGE concept was born in the wake of 
the 1990s spread of bovine spongiform enceph- 
alopathy, or ‘mad cow’ disease. Beddington was 
the first UK chief science adviser to gather a 
SAGE group together, during a 2009 influenza 
pandemic. Because of that experience, he says, 
when the Eyjafjallajokull crisis began, “I knew 
the sort of people Id need” 

The SAGE volcanic ash group met for the first 
time on 21 April, after London's Heathrow air- 
port — the world’s busiest — had faced the can- 
cellation of more than 97% of its flights for five 
days straight. The group included Sue Loughlin, 
a volcanologist at the British Geological Survey 
in Edinburgh, who did her PhD on Eyjafjalla- 
jokull and had served in Montserrat, in the West 
Indies, during a deadly eruption there in 1997. 
Loughlin and others supplied basic information 
about the volcanos geological history and the 
pace of the ongoing eruption. 

Yet the ash cloud, and the crisis, moved faster 
than the advisory group. Pressured by airlines 


that wanted to resume flights, Europe's transport 
ministers had on 19 April quickly brokered 
revised operational guidelines. Planes were in 
the air again even as SAGE began to meet. 

After the initial eruption had quietened, the 
volcano continued to spew low levels of ash, and 
nobody knew whether the activity might pick 
up again. Three more times over the next two 
months, Beddington convened SAGE to assess 
technical details about the eruption and the like- 
lihood of more to come. Among other things, 
the group explored whether a nearby volcano 
named Katla might also erupt, as it has in the 
past along with Eyjafjallajékull. The advisers 
passed that information to the Cabinet Office, 
which used it to develop scenarios for future 
volcanic-ash emergencies. 

SAGE also pushed government departments 
to assess the risk of future, larger volcanic erup- 
tions. In 2012, the Cabinet Office added Icelan- 
dic eruptions to Britain’s National Risk Register, 
the official list of possible events that could dis- 
rupt society. “I had not thought ofit at all before 
then,’ says Beddington. “It was very embar- 
rassing.’ The Cabinet Office is also working up 
a detailed scenario for how to respond in the 
face ofan eruption that could spew sulphur and 
other toxic gases across Britain, as the Icelandic 
volcano Laki did for eight months in 1783-84. 

David Alexander, a risk expert at University 
College London, says that the Eyjafjallajékull 
experience improved some aspects of disas- 
ter response in the country. The International 
Civil Aviation Organization, for example, has 
updated and clarified its guidelines on how 
much ash planes can fly through safely. And the 
UK Met Office has fine-tuned its atmospheric 
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models for predicting the spread of dry ash 
through the air. 

But nearly all parts of the government took 
much too long to respond to the crisis, Alex- 
ander says. And he notes that even now, no 
coordinated plan exists to manage alternative 
transportation, such as the ferries, trains and 
taxis that became overloaded in April 2010. 
“There is still no adequate way for dealing with 
millions of stranded people; Alexander says. 

Before Beddington left office in 2013 — 
replaced by Mark Walport — he activated the 
SAGE mechanism once more, this time to 
provide advice about whether to evacuate the 
British nationals in Japan after the 2011 melt- 
down at the Fukushima nuclear power plant. 
SAGE modelled how radioactive material might 
spread and concluded that the risk of being 
exposed to radiation was relatively modest. In 
the end, the government provided iodine tablets 
as a precautionary measure but told its nationals 
they could stay put. 

And just in case, Beddington also ran some 
tabletop exercises for a major space weather 
event that could blow out power grids as well as 
other events so alarming that he prefers not to 
even name them. “If any of these instances had 
happened,” he says, “we'd have been in position 
to pulla SAGE team together.” 


DEEP TROUBLE 


Even as the volcanic ash cloud was spreading 
over Europe, science officials in the United 
States were struggling with their own crisis, 
one of the biggest ecological disasters in the 
nation’s history. It all started just after 9 p.m. 
on 20 April 2010, when an engineer aboard 


During the 2010 Gulf oil spill, President Barack Obama met with scientists Steven Chu (far left), Jane Lubchenco and John Holdren, and coast guard 
commandant, Thad Allen. 
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the BP Deepwater Horizon oil rig in the Gulf 
of Mexico noticed an odd vibration. Minutes 
later, the rig exploded, killing 11 men and 
beginning a months-long effort to stanch the 
flow of oil from a damaged well on the sea floor 
and avert an environmental catastrophe. 

By the time the leak was first plugged in 
early August 2010, an estimated 4.9 million 
barrels had gushed into the gulf — surpassing 
all previous marine spills — and in so doing 
had put US President Barack Obama's vaunted 


between 25,000 and 100,000 barrels per day. 
“The estimates that kept coming up formally 
from the agencies were to me just too low,” 
says Kate Moran, an oceanographer then 
working as a senior policy analyst at the OSTP. 
That error created problems for BP when it 
tried to cover the gushing wellhead with an 
iron dome, because the cap was unable to 
contain the volume of escaping oil and gas. 

It took until 27 May for the committee 
assembled by the administration to deter- 


"AT MANY CRITICAL JUNCTURES IN THE PROCESS, 
THE GOVERNMENT WAS ON THE WRONG SIDE OF HISTORY.” 


first-term science team to its toughest test, one 
for which it has received mixed marks. 

Deepwater Horizon was a daunting disas- 
ter, with crude oil gushing from a reservoir 
of unknown size at a depth of 1,500 metres. 
But for more than a month after the blowout, 
the administration vastly underestimated the 
amount of oil flowing into the gulf — a mis- 
take that hampered efforts to cap the leaking 
well and undermined public confidence in the 
president's response to the crisis. Key science 
officials, including presidential science adviser 
John Holdren, were slow to correct the errone- 
ous estimates — even as academic scientists 
argued that the spill was orders of magnitude 
larger than BP or the government had publicly 
stated. 

On paper, the crisis seemed tailor-made for 
the all-star group of scientists that Obama had 
assembled after he took office in 2009. Hol- 
dren, a physicist, directed the White House 
Office of Science and Technology Policy 
(OSTP), while Jane Lubchenco, a marine 
ecologist, helmed the National Oceanic and 
Atmospheric Administration (NOAA). Geo- 
physicist Marcia McNutt had stepped in to 
lead the US Geological Survey (USGS). And 
the Energy Department boasted Nobel-prize- 
winning physicist Steven Chu as its leader. 

Holdren says that the government sought 
help from outside scientists within hours of 
the explosion. But in many respects, Obama 
and his science team moved too tentatively. 
It was not until 19 May — almost a month 
after the blowout — that the administration 
assembled a group of scientists and engineers, 
headed by McNutt, to revise the controver- 
sial flow-rate calculations. The oil flow was 
extremely hard to estimate because there was 
no direct way to measure the well’s output, 
and industry and research scientists disagreed 
about how best to do it. 

NOAA had long maintained that the flow 
was about 5,000 barrels a day, but independent 
scientists examining satellite imagery of the 
growing oil slick and BP video of the under- 
sea well had argued that the actual output lay 


mine that the flow rates determined by NOAA 
and BP were indeed too low. The group esti- 
mated that 12,000-19,000 barrels of oil were 
spilling into the gulf each day. Ultimately, the 
government would arrive at a much larger fig- 
ure — 62,000 barrels a day immediately after 
the blowout, dwindling to 53,000 barrels a day 
in August, before a temporary seal stopped the 
flow. 

“The most difficult part about it was try- 
ing to understand what we were getting from 
BP, and whether we really understood the 
possible sources of error,’ says McNutt, who 
arrived at BP’s oil-spill operation centre on 
6 May and remained for the duration of the 
crisis. “It actually took some time, and maybe 
too much time, to realize the magnitude of the 
problem,’ says Larry Mayer, an oceanographer 
at the University of New Hampshire in Dur- 
ham. He argues, however, that the response 
quickly improved in mid-May, when Holdren 
and other officials met with scientists from 
some of the country’s top oceanography pro- 
grammes, seeking access to equipment such 
as remotely operated underwater vehicles and 
ways to track the oil as it spread in a plume 
below the surface. 

That plume was a source of great grief for 
NOAA chief Lubchenco, whose agency was 
mandated by US law to assist the Coast Guard 
in tracking the oil’s path through the Gulf of 
Mexico and monitoring its effects. In mid- 
May, academic researchers reported finding 
masses of microscopic oil droplets 1,000-1,400 
metres below the ocean surface, spreading out- 
ward for tens, possibly hundreds, of kilometres 
from the leaking wellhead. Almost immedi- 
ately, Lubchenco issued a statement calling 
reports of those findings “misleading, prema- 
ture, and in some cases, inaccurate’, drawing 
a wave of criticism from oceanographers who 

felt that she was unduly 
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oceanographer,” says Ian MacDonald from 
Florida State University in Tallahassee. It took 
another 22 days before NOAA acknowledged 
the presence of the plume — a delay that fed 
mistrust of the agency by outside scientists. 

By contrast, McNutt and Chu drew praise 
for their actions during the crisis. While 
McNutt helped to determine the amount of oil 
leaking into the gulf, Chu worked to stop the 
flow. He arrived in Houston on 12 May, accom- 
panied by distinguished scientists recruited 
with Holdren’s help, including some from 
JASON, a storied panel that advises the gov- 
ernment on issues such as defence and energy. 
They and others quickly began challenging BP 
for more and better data about the state of the 
well. Chu convinced the oil company to moni- 
tor the wellhead using y-ray imaging as well 
as temperature and pressure gauges, which 
provided the first direct measurements of the 
still-flowing oil. 

Finally, on 19 September — 150 days after 
the initial explosion and well blowout — the 
Coast Guard declared an end to the disaster 
after engineers inserted a cement plug to per- 
manently seal the well. Four years later, disap- 
pointment still lingers among many scientists 
about the way in which the Obama administra- 
tion handled Deepwater Horizon. Stopping the 
oil’s flow “was a huge effort, and people worked 
heroically and tirelessly’, MacDonald says. But, 
he argues, the scientists leading government 
agencies should have acted more quickly and 
provided better information about the extent 
and nature of the spill. “At many critical junc- 
tures in the process,” he says, “the government 
was on the wrong side of history.” 


OUTBREAK 


A year after the Deepwater Horizon spill, doc- 
tors in Hamburg, Germany, put outa call for 
help. On 19 May 2011, three children at a city 
hospital were battling haemolytic uraemic syn- 
drome, a life-threatening condition caused by a 
severe gastrointestinal infection with the bac- 
terium Escherichia coli. Worried that the out- 
break could spread, Hamburg health officials 
contacted the Robert Koch Institute (RKI) in 
Berlin — Germany’s federal agency for disease 
control and prevention. 

When three RKI epidemiologists arrived in 
Hamburg the next day, it was clear that some- 
thing unusual was going on. Several other cases, 
including some in adults, had popped up at hos- 
pitals around the city and reports soon came in 
from other regions. What followed was Europe's 
worst recorded outbreak of E. coli infection. By 
the time the outbreak was declared over, some 
two months later, more than 3,800 people had 
developed acute gastrointestinal infections. Of 
those, 845 had progressed to HUS, and 54 had 
died. The correct source of the infection — fen- 
ugreek seeds from Egypt — was not identified 
to the public until 5 July. 

Reinhard Burger, president of the RKI, says 
that the outbreak “was a good example of how 


TOBIAS SCHWARZ/REUTERS/CORBIS; HERMANN BREDEHORST/POLARIS/EYEVINE 


rapidly a new threat can appear and develop 
and affect the population”. The event also 
illustrated the problems that can emerge when 
scientists at multiple agencies — reporting to 
different levels of government — respond to a 
public-health emergency. 

The RKI’s most pressing task was to identify 
the strain of bacterium that was causing the 
infection and then where it came from. E. coli 
infections are typically caused by contami- 
nated foods, and the epidemiologists began 
by interviewing patients about their recent 
diet. “We knew within the first two days or 
so that the usual suspects — fresh milk and 
raw-meat products — were not the problem,” 
says Burger. The only thing all patients seemed 
to have in common was that they had eaten 
salads containing fresh tomatoes, cucumber 
and lettuces, in northern Germany. On 25 May, 
the RKI, which reports to the federal health 
ministry, issued a joint statement with the fed- 
eral institute for risk assessment (BfR), saying 
that these vegetables were associated with 
increased risk of infection. 

Science advisers at the RKI say that the 
various federal agencies responsible for dis- 
ease control and food safety worked together 
smoothly to assess the risks and communi- 
cate them to the public. But the coordination 
between them and agencies that reported to 
state governments was not nearly as efficient. 
As cases emerged throughout Germany, the 
chain of orderly communication cracked. The 
worst blunder came on 26 May, when Ham- 
burg health senator Cornelia Priifer-Storcks 
announced that the Hamburg Institute for 
Hygiene and Environment had discovered 
enterohaemorrhagic E. coli (a pathogenic 
class of E. coli) on Spanish cucumbers. But she 
had not consulted the RKI before making that 
statement. And tests a few days later revealed 
that the bacteria on the suspect cucumbers did 
not belong to the same strain found in patients. 

The ongoing uncertainty about the source 
of the bacteria led to trade restrictions and 
large economic losses, particularly in Spain, 
where farmers found themselves unable to 
export cucumbers and other suspect produce. 
The European Commission eventually issued 
a €227-million (US$302-million) payout to 
farmers from several countries. 

Burger says that the Hamburg statement 
“damaged confidence in public announce- 
ments”. The problem was compounded by the 
public’s increasing hunger for information 
while epidemiological investigations were 
failing to turn up more-specific leads. “I was 
facing the media every day, but sometimes we 
had nothing new to say,” he says. 

Looking back, health officials say that some 
technological improvements could help. 
For instance, Burger says that using genetic 
sequencing rather than the current culture- 
based diagnostic techniques might have 
helped physicians to recognize the outbreak 
more quickly. 
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Local authorities in Germany incorrectly identified Spanish cucumbers as the source of deadly E. coli 
during an outbreak in 2011. Reinhard Burger, who heads the nation’s federal agency for disease control 
and prevention, and other health officials worked to clear up the confusion. 


The response was also hampered by a 
complicated reporting chain: when physicians 
confirm a case ofa disease, they report it to local 
health authorities, who forward it to the RKI. 
Before the outbreak, that process could take up 
to 16 days. A change to the law last year means 
that reports must now reach the RKI within 
5 days. But the notifications can still trickle in by 
phone, fax or e-mail. The federal health minis- 
try is therefore developing an electronic system 
to provide faster and simultaneous notification 
for local and federal authorities. 

Such a system would not bypass the prob- 
lem that the RKI can act only at the request of 
state agencies — arule that some feel should be 
changed. “When there is the impression than 
an outbreak is affecting more than one state, 
the RKI should have the right to start inves- 
tigations on its own,’ says Ulrich Frei, medi- 
cal director of the Charité hospital in Berlin, 
which handled several cases. “Health is one of 
only a couple of topics in which the German 
states still have much authority, and they’re 
not very willing to transfer this to the federal 
level,” he says. 


As with natural disasters and human-caused 
crises, the outbreak points out the benefits of 
high-level coordination during emergencies. 
Peter Gluckman, New Zealand's chief science 
adviser, grappled with this issue when an earth- 
quake levelled much of Christchurch in 2011. 
In the aftermath, competing scientific experts 
debated future risks in the media in a way that 
spurred confusion among government offi- 
cials and the public, says Gluckman. His office 
spent weeks trying to get scientists to provide 
clear information about existing threats and 
uncertainties. A crisis demands scientific coor- 
dination, he says. “Often it is made worse by 
inconsistent communication”. 


Alexandra Witze, who wrote about 
Eyjafjallajokull, reports for Nature from 
Boulder, Colorado, and is the co-author of 
Island on Fire, a book about an Icelandic 
volcanic eruption in 1783. Lauren Morello, 
an editor with Nature in Washington DC, 
wrote about the oil spill. Marian Turner, an 
editor with Nature in London, covered the 
E. coli outbreak. 
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Residents in the flood-prone district of Kurigram, Bangladesh, move a community mosque to safer ground. 


Model human adaptation 
to climate change 


We can no longer ignore feedbacks between global warming and 
how people respond, say PaulI. Palmer and Matthew J. Smith. 


[ vet models of Earth’s climate 
capture physical and biophysical pro- 
cesses. But the planet has entered a 

new state: humans are adapting to, as well as 

causing, environmental changes. This major 
feedback must be modelled. Projections of 
the future climate based on simple economic 
narratives’ — from cuts in greenhouse-gas 
emissions to unmitigated growth — are 
unrealistic. 

Faced with droughts and rising sea levels, 
people alter their behaviour. Even if global cli- 
mate policy is effective, and nations deliver 


on ambitious green-energy-production 
and sustainability targets, societies will be 
different in a warmer world. People will move 
to places that are richer in resources, or stay 
where they are and be pushed further into 
poverty. Population growth, urbanization, 
migration’ and conflict’ will compound reac- 
tions to global temperature rises. 

To understand how events might unfold 
and what kinds of responses will be most 
effective, Earth-system models need to cap- 
ture human-climate dynamics. It will be an 
enormous challenge: we are only beginning 


to understand how people respond to their 
environment. But omitting human behav- 
iour is like designing a bridge without 
accounting for traffic. 


DECISIONS, DECISIONS 

There are two main scientific challenges 
to modelling socio-economic responses 
to climate change. The first is describing 
how humans make decisions. The second 
is describing the relationships between 
humans and the physical and biophysical 
components of the Earth system. > 
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>» Decision theory approximates how a 
person or group makes a choice on the basis 
of their values, aspirations, health, uncer- 
tainties and rationality. Values may vary 
according to, for example, age, wealth, edu- 
cation, sex, culture and religion. One group 
of people might see adopting nuclear power 
as a practical way to avoid greenhouse-gas 
emissions; another would find it unpalatable. 
Choices, such as whether to move or stay ina 
location as average temperatures rise, will be 
influenced by events such as droughts, fires 
or floods. Outcomes and pay-offs might be 
as great as life or death. 

To understand the underlying patterns, 
we need to collect behavioural statistics on 
grand scales. How do people of different 
backgrounds respond to extreme weather, 
for example? Under severe drought, do 
people in sub-Saharan Africa behave dif- 
ferently from those in southern Australia? 
How do the decisions made by lower- and 
middle-income families differ? 


ASSESSING INFLUENCE 

Some statistics can be drawn from existing 
databases, such as health and education data 
compiled by the United Nations or national 
censuses. Measures of social and physical 
mobility can be extracted from social media 
and Global Positioning System (GPS) infor- 
mation from mobile phones without raising 
privacy issues. A 2008 study found that the 
distances travelled in six months by 100,000 
mobile-phone users were not random, but 
were well described by a simple function’. 

Historical studies can inform us about 
possible future scenarios. For instance, 
Peruvian civilizations migrated between 
coastal valleys and the Andean highlands 
between ap 640 and 1200. Historical climate 
records and mitochondrial DNA sequences 
extracted from the bones of people who lived 
there showed that coastal droughts and flash 
floods pushed the populations to move to the 
mountains, from where they returned when 
droughts afflicted those lands’. Other studies 
have reported a higher incidence of conflict 
since 1950 among communities experi- 
encing changes in climate — cases rose by 
a few per cent for each standard-deviation 
increase in temperature or rainfall’, 

More surveys and longitudinal studies are 
needed to find out what sorts of risks indi- 
viduals say they will react to most strongly, 
and how they actually respond. For example, 
a study of migration intentions in rural Paki- 
stan from 1991 to 2012 showed that extreme 
heat influenced people's choices but flood- 
ing did not®. The main reason was that heat 
waves do not attract as much financial com- 
pensation as flood relief. 

We must also find out which measure- 
ments can act as reliable indicators for a 
broad spectrum of behaviours. For example, 
income can be a strong predictor for rates of 


migration, conflict and reproduction, but 
alone, income is insufficient to predict any 
one of these behaviours. 

And individuals do not always make 
rational or independent decisions. People 
often rely on simple rules to form judge- 
ments’. They might assume that others 
know better than them and thus follow social 
norms. Friends and relatives may hold sway, 
and news, online and social media are shared 
sources of information and influence. 

Alternatively, individuals might act 

independently and 


“How do rationally in the short 
people of term, yet collectively 
different destroy a shared 
backgrounds resource in the long 
respond to term, as with climate 
extreme change. People often 
weather?” have to make deci- 


sions with too little or 
too much information. Taking out insurance 
against flooding, for instance, might have to 
be done without knowing the future likeli- 
hood of extreme rainfall. 

It is therefore essential to understand how 
the individual decisions combine across net- 
works to produce macroscopic behaviours. 
A challenge for modellers will be to find 
which scales of social structure (govern- 
ments, parties, tribes, clubs or neighbour- 
hoods) exert most influence, as well as 
accounting for regional differences. 

A range of modelling techniques will be 
necessary. One is the agent-based approach. 
Thousands or millions of ‘agents’ — digital 
proxies for individuals — encoded with 
attributes such as age or wealth inter- 
act according to simple rules. Complex 
group responses, such as the dynamics of 
infectious-disease spread, can emerge. But 
outcomes depend on which attributes are 
assigned so modellers need to ensure that 
the chosen variables are the key determi- 
nants of a particular response in real life. 
Rather than be a response to the physical 
effects of heat, a war in a warming region 
might reflect the decline in local economic 
conditions because of crop failure or an 
influx of people from a neighbouring region. 

Inference methods (such as approximate 
Bayesian computation) could be used to 
discover which individual attributes contrib- 
ute most to group behaviour. Methods for 
modelling groups of individuals with simi- 
lar characteristics can still generate emergent 
mass responses, revealing, for example, that 
the contrasting dispersal of animals on land 
and in oceans contributes to the observed 
contrasts in ecosystems’. 


TAKE STOCK 

Three main things are needed to model 
human responses to climate change: interdis- 
ciplinary research, appropriate computational 
and conceptual frameworks, and better data. 
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Collaborations between natural and social 
scientists should be facilitated by physical 
and virtual centres. A hub such as the Isaac 
Newton Institute for Mathematical Sciences 
in Cambridge, UK, where mathematicians 
come together to solve problems that require 
insights from many fields, could focus efforts 
for developing human-climate population- 
modelling techniques. 

Journals and funding bodies will need 
to smile more on such collaborations. The 
next (sixth) report of the Intergovernmental 
Panel on Climate Change should justify the 
importance of improved understanding and 
modelling of societal responses to climate 
change by reviewing the evidence across 
different working-group reports. 

Scientists will need to obtain new data 
sets and consolidate existing ones on 
decision-making processes associated with 
environmental change. With so many fac- 
tors, new approaches will be needed along- 
side conventional methods to collect data 
across a spectrum of spatial and temporal 
scales. This will involve data collection 
through social media, surveys and GPS, 
and will require a degree of analytical inte- 
gration that will challenge computational 
scientists. 

A parallel issue is how to store and analyse 
large volumes of heterogeneous data. Much 
data relating to individuals (such as social- 
network structures and movement patterns) 
will come with sensitive commercial or pri- 
vacy issues. Careful licensing arrangements 
between businesses, governments and aca- 
demic institutions will be needed. 

Ultimately, we must establish an interna- 
tional data-collection effort involving the 
public, private and voluntary sectors. Much 
as we take global stock of forests or biodiver- 
sity, we should regularly assess how people 
are being changed by the climate that they 
are changing. = 


Paul I. Palmer is professor of quantitative 
Earth observation at the University of 
Edinburgh, UK. Matthew J. Smith is an 
ecologist in the Computational Science Lab 
at Microsoft Research in Cambridge, UK. 
e-mail: pip@ed.ac.uk 
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An artist’s impression of the Milky Way — one of hundreds of billions of galaxies in the Universe. 


ASTROBIOLOGY 


Cosmic prestige 


Mario Livio welcomes a lucid description of attempts to evaluate how special humans are. 


ll the astronomical discoveries made 
A Nicolaus Copernicus demoted. 

Earth from its position at the cen- 
tre of the Universe have continued to erode 
humanity’s perceived physical significance 
in the grand scheme of things. Consider this 
sequence of events: in 1920, US astronomer 
Harlow Shapley showed that the Solar System 
does not occupy the centre of the Milky Way, 
but is about two-thirds of the way out. Then, 
Edwin Hubble discovered that there are many 
other galaxies — a few hundred billion in the 
observable Universe, according to the latest 
observations (made, fittingly, by the Hubble 
Space Telescope). Next, it was found that even 
the stuff we are made of — ordinary baryonic 
matter — constitutes less than 5% of the Uni- 
verse’s energy budget. To top it all, speculative 
models based on cosmic inflation and string 
theory suggest that our entire Universe may 
be but one member of a ‘multiverse’ a huge 
ensemble of some 10°” universes. 


On the planetary 
scale, there has also 
been an explosion 
of discoveries. Until 
1992, there had been 
no confirmed dis- 
coveries of any plan- 
ets outside the Solar j 
System. However, y 
observations since The Copernicus 
then (especially by the | Complex: The 
Kepler satellite) sug- Quest for Our 
gest that about 20% Cosmic (In) 
of all Sun-like stars Sieniliganie 

CALEB SCHARF 

in our Galaxy har-  ajjen Lane/Farrar, 
bour approximately — Straus and Giroux: 
Earth-sized planets 2014. 
orbiting in the stars’ 
‘habitable zones’ — the regions of space that 
are neither too hot nor too cold, allowing 
liquid water to exist on a planet’s solid sur- 
face. Given that liquid water is considered 
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potentially a necessary ingredient for life, 
these statistics are (at the very least) promis- 
ing for those who believe that there could be 
life elsewhere. 

That Earth hosts life remains its last quali- 
fication for being special. How reasonable 
is it to think that we are alone in the vast 
expanses of space? And how significant 
is life on Earth on the Universal (or multi- 
versal) scale? These are the questions that 
astrobiologist Caleb Scharf addresses intelli- 
gently and comprehensively in his beautifully 
written The Copernicus Complex. The book 
offers a grand tour of important findings 
from astronomy to biology that are relevant 
to the cosmic and microscopic search for life. 

What sets this book apart from those that 
simply describe the hunt for exoplanets is 
Scharf’s emphasis on the significance, or lack 
thereof, of our own existence. For instance, 
the realization that the human body contains 
ten times as many microbial cells as human 


NASA/ESA/Z. LEVAY (STSCI/AURA) 


cells — as well as impressive advances in the 
understanding of the chemical origin of life 
— has forced us to rethink how we classify the 
‘importance’ of life forms on Earth, and per- 
haps even to consider placing microbes at the 
top of the hierarchy, rather than at the bottom. 
Scharf tackles in some detail the question 
of whether we can conclude anything about 
the expected frequency of extraterrestrial life 
from the known facts about the emergence 
and evolution of life on Earth, particularly 
two important clues. The first is that some 
form of life arose 


“Our place in very early in Earth's 
the universe is history, within only 
special but not a few hundred mil- 
significant, lion years of the 
unique but not planet’s formation; 
exceptional.” the second, that 


the appearance of 
‘intelligent’ beings took a few billion years. 
After sketching the basics of Bayesian prob- 
ability theory, Scharf describes the interesting 
results of astrophysicists David Spiegel and 
Edwin Turner. They have shown that in the 
absence (so far) of any evidence of life aris- 
ing independently of our lineage, one cannot 
reach any conclusions about the rarity (or 
not) oflife in the Universe. This highlights the 
importance of the search for that evidence. 

Scharf ends his book with the reflection 
that life inhabits the border between order 
and chaos. For instance, the dynamics of the 
planetary orbits in our Solar System are so 
complicated that they may become unstable 
within a few billion years. Similarly, Earth’s 
climate and geophysics occupy that interface 
between order and disorder. From that, Scharf 
concludes that “our place in the universe is 
special but not significant, unique but not 
exceptional”. Note, however, that from the 
perspective of thermodynamics (entropy), 
life itself is an extremely ordered system. 

I see two other important messages, with 
which I am sure Scharf would agree. One is 
that given the number of space telescopes 
that either are being built (such as the James 
Webb Space Telescope, to be launched in 
2018) or have been proposed (including 
the Advanced Technology Large-Aperture 
Space Telescope), we may, for the first time 
in human history, be close to determining 
whether extraterrestrial life exists. Some 
optimistic estimates predict such a discov- 
ery in the next two decades. The second is 
that, notwithstanding our physical insig- 
nificance, the human mind is significant. 
Why? Because all the discoveries described 
in this book, from the subatomic realm to 
the multiverse, were made by us. m 


Mario Livio is an astrophysicist at the Space 
Telescope Science Institute in Baltimore, 
Maryland. His most recent book is Brilliant 
Blunders. 

e-mail: mlivio@stsci.edu 
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Sapiens: A Brief History of Humankind 

Yuval Noah Harari HARVILL SECKER (2014) 

This newly translated Israeli best-seller by historian Yuval Noah 

Harari delivers a boldly synthesized account of Homo sapiens’ rise 

through the hominin ranks, by way of the cognitive, agricultural 

and scientific revolutions. Harari offers some original reframing 

of phenomena such as symbolic thinking. A leitmotif of cruelty in 

domains from war to livestock rearing also emerges — perhaps 

unsurprisingly in a species that, Harari argues, reached the top of 

y the food chain by acting like the dictator of a banana republic. A view 
of our ascent as nasty, brutish, long — and endlessly fascinating. 


A Prescription for Psychiatry: Why We Need a Whole New 
Approach to Mental Health and Wellbeing 

Peter Kinderman PA 
The furore over the 
Association’s Diagnostic and Statistical Manual of Mental Disorders 
showed anew the ri 
medical interventio 
the fray. Arguing against biomedical reductionism, he offers a social 
and psychological model of mental illness, and calls for reforms 
such as multidiscip 
big societal changes to promote mental well-being. 


LGRAVE MACMILLAN (2014) 
2013 edition of the American Psychiatric 


fts in psychiatry over diagnostic hair-splitting and 
ns. Here, psychologist Peter Kinderman enters 


inary care, reduced use of pharmaceuticals, and 


The New Moon: Water, Exploration, and Future Habitation 

Arlin Crotts CAMBRIDGE UNIVERSITY PRESS (2014) 

Is lunar exploration an old story? Pushing aside the political hurdles 
that impede a US return to the Moon, astrophysicist Arlin Crotts 
mines lunar research and its implications for human colonization in 
staggering, often deeply engaging, detail. Beginning with a scientific 
portrait of Earth’s satellite, he probes missions from the cold war and 
international space activity since then; delves into findings on lunar 
chemistry, the Moon’s far side and aspects such as outgassing and 
moonquakes; and explores in technical but accessible detail what we 
can glean from all this regarding a human presence on the Moon. 


The Bee: A Natural History 

Noah Wilson-Rich \vy (2014) 

Well over 100 million years ago, flowering plants debuted on Earth 
— and early wasps began to co-evolve with them into the bee. As key 
pollinators and providers of honey and wax, bees have buzzed their 
way into human history. But the natural history of solitary, bumble, 
honey and stingless bees is as gripping as our lengthy alliance, as 
urban beekeeper Noah Wilson-Rich and contributors show in this 
charming compilation. They cover evolution, biology (including a 
unique proboscis made of two organs), behaviours (such as honey 
bee “quacking”), the causes of catastrophic die-offs, and more. 
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Curious: The Desire to Know and Why Your Future Depends On It 
lan Leslie BASIC Books (2014) 

Deep questing is essential to keeping your cognitive edge and social 
intelligence razor-sharp throughout life, posits lan Leslie. He weaves 
ample science into his exploration; the brain’s caudate nucleus, 

for example, is associated with romantic love as well as the urge to 
learn, putting the latter “onto the same pathway as our most primal 
pleasures”. Staying curious into old age, he argues, demands that 
we imitate Benjamin Franklin: “sweat the small stuff while thinking 
big”, and never stop experimenting. Barbara Kiser 
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Transgressive treats 


Paul L. McEuen relishes Margaret Atwood’s acerbic 
tales of sex, hallucinations and death by stromatolite. 


Mattress, Margaret Atwood’s fine new 

collection of nine dark, witty tales. The 
first three form a mini-trilogy; the remain- 
ing six constitute a smorgasbord of horror 
and crime inflected with science, not least 
characters with rare medical conditions, and 
a murderous use for one of Earth’s oldest 
fossils. 

Many of Atwood’s most famous works, 
such as The Handmaids Tale (1985, McClel- 
land and Stewart) or the MaddAddam tril- 
ogy, look at individuals struggling under 
the yoke of a technologically fractured, 
dystopian society. Stone Mattress follows in 
this tradition, but the oppressor here is sex, 
along with its dark partner, death. 

Sex is an innovation as old and 


S" and death are the pole stars of Stone 


revolutionary as 
eukaryotic life itself. 
Each new generation 
is a risky shuffle of the 
genome, rolling the 
dice of variation in 
spectacular fashion. 7 
To complete the deal, 
the previous model 
needs to exit the stage 


Sto. } 
TON | 
“Sater RES Vv 


Stone Mattress: 


—a strategy that is Nine Tales 
successful for the spe- MARGARET ATWOOD 
cies, but wreaks havoc Bloomsbury/Nan 


on the individual. 4 /alese: 2014 


So the trio of stories 

at the start centres on elderly ex-lovers for 
whom death lies directly ahead, and sex is a 
signpost in the rear-view mirror. Centring 
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on a widowed author of cult fantasy fiction, 
Constance, the first tale is ‘Alphinland’ — 
named after the river Alph in Samuel Taylor 
Coleridge’s poem Kubla Khan, a symbol of 
human mortality that flows into a “sun- 
less sea’. Alphinland is the fictional realm 
that Constance has crafted, with the artist’s 
perennial aim of cheating sex and death 
by creating alternate universes. ‘Revenant’ 
and “Dark Lady’ trace the final days of 
Constance’s once-youthful poet-lover and 
the insubstantial muse who succeeded her. 
Constance tries to imprison them all fiction- 
ally in Alphinland, but cannot halt time in 
the real world. 

From here the tales venture into stranger 
territory. In ‘Lusus Naturae; the genetic lot- 
tery goes awry, creating an erudite woman 
whose congenital abnormality makes her 
resemble a vampire. The eponymous ‘Stone 
Mattress’ was originally cooked up by 
Atwood as a campfire tale on a trip in the 
Canadian Arctic. In it, we follow the ageing 
Verna on a tour boat in the Arctic. She has 
madea career of marrying elderly men with 
weak hearts, then hastening their deaths by 
giving them tacit permission to satisfy every 
forbidden desire. Confronted by a sexual 
predator from her past, she graduates to 
full-on murder using a 2-billion-year-old 
stromatolite, the fossilized remains of bio- 
films created by some of the very earliest 
forms of life. 

Only once does Atwood turn to a larger 
stage, in “Torching the Dusties’ — my per- 
sonal favourite. In it we meet Wilma, who 
lives in a comfortable retirement commu- 
nity. She is afflicted with Charles Bonnet 
syndrome, in which visual hallucinations of 
remarkable clarity — in this case, colourfully 
dressed little people who dance on window- 
sills — emerge as sight fades. Wilma is doted 
on by Tobias, a courtly fading romantic. They 
live an idyllic life, until news begins to filter 
in of angry mobs torching retirement com- 
munities around the world. The dusties, as the 
mobs refer to the elderly, are taking up too 
much time, space and money. When crowds 
gather at the gate, Wilma and Tobias make a 
run for it — to what end, we can only imagine. 

Atwood is an author of remarkable gifts. 
Poet, storyteller and scientific acolyte, she 
ponders the inevitable end that awaits us all, 
and with great courage flushes her quarry out 
into the open. I feel about Stone Mattress the 
way I imagine Atwood feels about life: with 
the end in sight, I am ready to begin again. m 


Paul L. McEuen is the John A. Newman 
Professor of Physical Science at Cornell 
University in Ithaca, New York, and 
director of the Kavli Institute at Cornell for 
Nanoscale Science. His scientific thriller 
Spiral was named best debut novel of 2012 
by International Thriller Writers. 

e-mail: pmceuen@gmail.com 
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Uphold China’s plan 
for cleaning water 


We disagree with aspects of Tao 
Tao and Kunlun Xin’s plan for 
sustainably producing China's 
drinking water (Nature 511, 
527-528; 2014). 

In our view, the country should 
continue to improve its mains- 
water infrastructure: upgraded 
pipework is essential to its 
urbanization strategy (see X. Bai 
et al. Nature 509, 158-160; 2014). 

Projects are already under way 
for recycling water and cleaning 
water sources. In Beijing, for 
example, recycled water is used 
in industrial processes, for 
replenishing rivers and lakes, 
and for irrigation and domestic 
purposes. And Yuanping city 
in Shanxi province now has a 
purification plant for industrial 
wastewater. 

We question the feasibility 
of installing household water 
purifiers, a core feature of the 
authors’ plan. Each would cost 
around 1,500 renminbi (US$244), 
which amounts to 645 billion 
renminbi for all the families in 
China — much more than is 
needed to upgrade pipes and 
water-treatment plants. Door- 
to-door recycling of filters could 
prove harder to manage than 
centralizing purification of 
household water in treatment 
plants. Also, filters in domestic 
purifiers could be a health risk if 
not changed frequently enough. 

People's reluctance to switch 
from traditional ways or to use 
poor-quality water for tasks such 
as laundry could also undermine 
the success of the proposed plans. 
Yanhong Tang Northeast 
Agricultural University, Harbin, 
China. 

Xin Miao Harbin Institute of 
Technology, China. 
tangyanhong@aliyun.com 


When less means 
more on dairy farms 


Vanishing profit margins in dairy 
farming are intensifying pressure 
on the ubiquitous, genetically 


elite, high-yielding Holstein milk 
cow. A radical change in strategy 
is needed — less-intensive 
agriculture and a reduction in 
consumer wastage (see go.nature. 
com/bwichl) could more than 
compensate for lost production. 

The Holstein’s remarkable milk 
production (up to 10,000 litres 
a year) is associated with poor 
condition, fertility and survival 
(see P. Dillon et al. Livestock 
Sci. 99, 141-158; 2006), and 
has welfare implications. Its 
large disease burden demands 
routine hormone and antibiotic 
treatment, despite concerns 
about antimicrobial resistance. 
During a single lactation, one cow 
can eat more than its own body 
weight in cereals, much of which 
is potential human food and is 
grown using polluting artificial 
fertilizer. 

An alternative would be to use 
cattle breeds that are genetically 
more resilient, less disease- 
prone and that have male calves 
suitable for beef production 
(females are used as milking- 
herd replacements). Cereal 
supplementation of feed may be 
minimized by taking advantage 
of the ruminant’s ability to digest 
forage, green waste and fibrous 
by-products. Such a cow can still 
produce up to 8,000 litres of milk 
annually. 

As the Pareto principle, or 
‘80:20 rule; predicts, that would 
amount to around 80% of yield 
for just 20% of the environmental 
and welfare costs. 

Mark C. Eisler, Michael R. F. 
Lee University of Bristol, UK. 
Graeme B. Martin University 
of Western Australia, Crawley, 
Australia. 
mark.eisler@bristol.ac.uk 


Private collections of 
fossils are a plus 


Paul Barrett and Martin Munt 
contend that private collections 
of fossil specimens hold back 
science because they are not 
readily accessible (Nature 512, 
28; 2014), but this need not be the 
case. The solution lies in closer 


collaboration between private 
collectors and palaeontologists. 

Private collectors provide 
a valuable service: many 
scientifically important 
specimens would never 
have been found, collected 
or prepared without their 
enthusiasm and dedication. 
This applies to every specimen 
of Archaeopteryx (the earliest 
bird) discovered so far. Public 
institutions, by contrast, often 
do not have the funds or staff to 
carry out essential excavations, 
or to acquire scientifically 
important specimens, which are 
not routinely donated. 

Thanks to the cooperation of 
private collectors, such specimens 
can be well described and 
documented (using computed. 
tomography scanning, for 
example). This is preferable 
to overlooking scientifically 
important data, even if access is 
subsequently limited. 

We agree that specimens 
should be housed in conditions 
that allow verification of earlier 
observations. But this is not 
always the case for specimens 
held in public institutions: in our 
experience, access to material is 
sometimes denied, specimens 
may be lost or destroyed, or 
exhibited in such a way as to make 
detailed study difficult. 

Oliver W. M. Rauhut, Adriana 
Lopez-Arbarello, Gert 
Worheide Bavarian State 
Collection for Palaeontology and 
Geology; and Ludwig-Maximilians 
University of Munich, Germany. 
o.rauhut@lrz.uni-muenchen.de 


Japan to learn from 
biomedical cases 


The recent spate of high-profile 
retractions of biomedical 

papers by Japanese scientists is 
undermining the push by Japan's 
prime minister, Shinzo Abe, to 
strengthen medical research and 
innovation in the country (see 
go.nature.com/cznkrb). To stem 
this apparent proliferation of 
research misconduct, we suggest 
that Abe’s government needs to 


reform relations between Japan's 
pharmaceutical industry and its 
health and medical community. 
Several cases have involved 
medical schools that receive 
unrestricted funding from big 
Japanese drug companies. These 
include the Jikei Heart Study, in 
which data manipulation was 
implicated in the clinical trial of a 
blood-pressure drug (see Lancet 
382, 843; 2013); the questionable 
procurement of patient data for 
a leukaemia drug by company 
employees (see J. McCurry 
Lancet 383, 2111; 2014); and 
criticisms of data handling in 
a large government-supported 
study on Alzheimer's disease (see 
D. Normile Science 345, 17; 2014). 
There is also the infamous 
case of more than 100 papers 
by a Japanese anaesthesiologist 
that are now in the process of 
being retracted (see Nature 489, 
346-347; 2012). 
These worrying examples 
may be symptomatic ofa 
deeper malady. As well as being 
under pressure to publish, 
biomedical researchers in Japan 
are having to rely increasingly 
on large donations from the 
pharmaceutical industry as 
government funding shrinks. 
The prime minister’s 
‘Abenomics’ strategy for 
better health care requires a 
global, fair and competitive yet 
collaborative environment and 
partnership among stakeholders. 
It is therefore imperative that 
investigations into misconduct 
allegations are openly and 
formally conducted and that the 
lessons learned are used to limit 
further cases. 
Tetsuya Tanimoto, Masahiro 
Kami, Kenji Shibuya University 
of Tokyo, Japan. 
tetanimot@yahoo.co.jp 


CONTRIBUTIONS 
Correspondence 

may be submitted to 
correspondence@nature. 
com after consulting 

the guidelines at http:// 
go.nature.com/cmchno. 


28 AUGUST 2014 | VOL 512 | NATURE | 371 
© 2014 Macmillan Publishers Limited. All rights reserved 


OBITUARY 


Peter Marler 


(1928-2014) 


Pioneering interpreter of animal language. 


hite-crowned sparrows 
sing in distinct dialects 
in different parts of Cali- 


fornia. African vervet monkeys use 
various alarm calls to signal different 
dangers, such as snakes, mammalian 
predators or birds of prey. Peter Robert 
Marler chronicled these phenomena 
to establish ideas about how animals 
communicate. He hoped to find clues 
about the biology of human language. 
How does language acquisition blend 
innate knowledge and learning? 

Marler, who died on 5 July, first 
became interested in animal sounds 
as a doctoral student in botany at 
University College London. Survey- 
ing potential nature reserves in Scot- 
land, UK, he noticed that the song of 
the chaffinch Fringilla coelebs changed from 
valley to valley. This led to a second PhD, in 
zoology from the University of Cambridge, 
UK, in 1954, for which he described the 
complete vocal repertoire of this songbird, 
revealing that different chaffinch calls sig- 
nalled different dangers. The observation 
was unprecedented. 

Born in 1928 in Slough, near London, 
Marler left England in 1957 to join the fac- 
ulty of the University of California, Berkeley. 
It was the heyday of ethology, the study of the 
behaviour of animals in their natural settings. 
Pioneered in Europe by Konrad Lorenz, 
Nikolaas Tinbergen and Karl von Frisch, 
ethology differed from work of comparative 
psychologists in the United States such as 
B. F. Skinner, who preferred to study animals 
— mostly rats and pigeons — in simplified 
laboratory settings. Whereas the psycholo- 
gists sought universal rules of learning that 
applied to all animals, ethologists sought to 
understand animals’ adaptations. 

Marler, a field biologist, taught his 
students to imagine the challenges faced by 
wild animals, the resources and threats they 
encountered, the bonds they had to build, 
and the information they sought and con- 
veyed. He challenged us to identify what sig- 
nals they used in which contexts, and what 
responses they elicited. The song of a territo- 
rial chaffinch was a loud and ringing threat 
to rivals, but when the same male courted a 
female at close quarters, his voice morphed 
into a jumble of soft sounds reminiscent of 
the sweet talk of lovers. Notice these things, 
Marler taught us, but be careful with the 
inferences you draw. 


I was enthralled when I first heard Marler 


lecture at Berkeley. Here was a young Charles 
Darwin building a rational edifice to under- 
stand how animals communicate. The goal 
was to discover how much of this process 
was instinctual, how much was learned and 
how had it all evolved. 

Marler was superb at designing laboratory 
and field experiments. He used the latest 
techniques to record, analyse and play back 
sounds so that observers and instruments 
participated in ‘conversations with animals. 
After quantifying animals’ responses, Marler 
would write his report, and there emerged 
his signature style. Taken together, he would 
say, “the observations suggested” some inter- 
pretation. It was his way of inviting readers 
and colleagues to join him in his exploration. 
No ringing truth, no strident dogma — it 
left him ample room to lead with ideas and 
observations while keeping the intellectual 
framework flexible. 

Marler moved from Berkeley to Rockefel- 
ler University in New York in 1966, and in 
1972 became the first director of the univer- 
sity’s Center for Field Research in Ecology 
and Ethology in Millbrook, north of Man- 
hattan. In this unique facility, he fostered the 
integration of field and laboratory work. In 
1989, he moved his laboratory to the Univer- 
sity of California, Davis, where he worked as 
a professor until retiring in 1994. 

Marler encouraged his students to study 
any species that took their fancy: birds, flies, 
tarantulas, lizards, electric fish, bats, ceta- 
ceans, gerbils and wolves, among others. 
Students went to Kenya to study vervet mon- 
keys, to Borneo to study orang-utans, and 
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to Japan to study macaques. Marler 
himself spent time studying colobus 
monkeys in Uganda and then, with 
primatologist Jane Goodall, the social 
behaviour of chimpanzees in Tanza- 
nia. He had hoped to glean insights 
on human language. He learned much 
about the signals that chimpanzees 
use, but felt that human language 
remained in its own class. 

Marler astutely realized that if he 
shifted his focus from language to 
vocal learning, then birds had much 
to offer. Early on, he noticed that 
songbirds were picky about what 
sounds they imitated, and that choice 
often occurred during the first year 
of life. Moreover, the stages of vocal 
learning were set. This explained why 
the dialects of his chaffinches disappeared 
when the young birds were reared away from 
wild-type models. Moreover, the what, when 
and how of vocal learning were directed, as if 
by an innate schoolteacher. Marler suggested 
that this interlocking of nature and nurture 
acted as an ‘instinct to learn. 

This important insight was an alternative 
to the learning by trial and reward favoured 
by comparative psychologists. When it 
came to learning language, Marler felt that 
humans were more like his songbirds than 
like Skinner's rats — a conclusion that had 
repercussions. If language acquisition, so 
central to our manner of thinking, is ruled by 
biological predisposition, what does this tell 
us about the nature of human knowledge? 
Is it as idiosyncratic as the dialect of a chaf- 
finch? At a time when much of the buzz in 
biology was molecular, Marler was one of the 
few philosophers of nature. 

Peter was a gentleman. He adored his wife 
Judith and their three children. For many 
years, she ran the encampments where Peter 
and his students went to collect nesting song- 
birds, and she established home nurseries for 
hand-rearing the young. Peter and Judith 
were extraordinarily gracious hosts. They 
loved good food, good booze, good conver- 
sation and a good party. It was the good luck 
of Marler’s students to be part of that world. m 


Fernando Nottebohm is professor of 
animal behaviour at Rockefeller University 
in New York, USA. He earned his PhD under 
Peter Marler at the University of California, 
Berkeley, in the 1960s. 

e-mail: nottebo@mail.rockefeller.edu 
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Hiding in plain sight 


The latest releases from the ENCODE and modENCODE research consortia more than double the number of data sets on 
functional elements in the worm, fly and human genomes. SEE ARTICLES P.393, P.400 & LETTERS P.445, P.449, P.453 


FELIX MUERDTER & ALEXANDER STARK 


ne of the major scientific achievements 
() of our time has been the sequencing 

of the human genome and those of 
model organisms such as fruit flies and worms. 
These sequences encode species-specific 
information about protein-coding and non- 
coding genes and the regulatory information 
that determines when and where the genes are 
activated. However, even though this genomic 
information is present in the sequences, under- 
standing it, or even just comprehensively iden- 
tifying and annotating the different functional 
elements, is a major challenge. In an effort to 
identify all functional elements in the genomes 
of humans, Drosophila melanogaster flies and 
Caenorhabditis elegans worms, the Encyclo- 
pedia of DNA Elements (ENCODE) and the 
Model Organism ENCODE (modENCODE) 
research projects were launched’”. This issue 
of Nature contains five papers*’ that sum- 
marize the latest data from these consortia. 
Together, the publications add more than 
1,600 new data sets, bringing the total number 
of data sets from ENCODE and modENCODE 
to around 3,300 (Fig. 1). 

The potential impact of such data is 
undeniable. More-complete genome annota- 
tions will form the basis for improved genetic 
studies in D. melanogaster and C. elegans — 
organisms that have already contributed most 
to our understanding of animal development 
and the molecular mechanisms involved. It is 
also increasingly clear that gene-regulatory 
elements are crucial for development and are 
frequently linked to disease; comprehensive 
identification of these elements should, for 
example, allow the interpretation of disease- 
associated mutations in non-coding genomic 
regions. 

Two of the papers present data on RNA 
transcripts — Brown et al.’ (page 393) in 
Drosophila and Gerstein et al.* (page 445) 
in all three species. Brown and colleagues’ 
analysis of the Drosophila transcriptome, 
which they assessed in 29 tissues, 24 cell lines 
and 21 whole-animal samples that had been 
subjected to environmental perturbations, 
yielded more than 300,000 transcripts for 
17,564 genes, of which 14,692 were protein- 
coding (different transcripts from the same 
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Figure 1 | The growth of ENCODE and modENCODE data sets. The ENCODE and modENCODE 
research consortia aim to identify all functional elements in the human genome and the genomes of 

the model organisms Caenorhabditis elegans (worm) and Drosophila melanogaster (fruit fly). The latest 
release*’ from these projects focuses on three key data types: RNA-seq, which identifies RNA transcripts 
from cells or whole organisms; ChIP-seq for regulatory factors, which identifies locations in the genome 
that are bound by these proteins; and sequencing-based assays to profile various features of chromatin 
(the complex of DNA and histone proteins). The graph shows the total number of data sets now available 
for these data types, compared with previous releases” (note that the numbers for the previous worm 
and fly releases do not include some microarray-based data sets). 


gene are referred to as transcript isoforms). 
Of these genes, 57 (5,259 transcripts) were 
expressed only during perturbations and 
would thus probably escape identification 
under standard laboratory conditions. The 
analysis also identified many new candidate 
long non-coding RNAs, including ones that 
overlap with previously defined mutations 
that have been associated with developmental 
defects. Another intriguing finding was a small 
number of mostly neuronal genes that give 
rise to half of all detected transcript isoforms, 
reminiscent of the many transcripts known to 
be generated from the neuronal gene Dscam’. 
These data show that sampling selected tissues 
under non-standard conditions allows new 
genes and transcript isoforms to be identified 
even in well-studied organisms. 

Regulatory elements are more difficult to 
identify than transcripts. They are typically 
predicted on the basis of characteristic fea- 
tures of chromatin (the complex of histone 
proteins and DNA in the cell nucleus) and by 
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studying regulatory-protein binding to DNA’ 
— refining such predictions is a key aim of 
both the ENCODE and modENCODE pro- 
jects. Among the latest releases, Araya et al.” 
(page 400) report the genome-wide binding 
profiles for 92 regulatory proteins, includ- 
ing transcription factors, RNA-polymerase 
subunits and chromatin-associated factors, 
in whole embryos and larvae from different 
developmental stages in C. elegans. Although 
this approach may provide information on 
regulatory changes during development, it 
is limited by a lack of cellular resolution”: 
transcription factors typically associate with 
cell-type-specific partner proteins to bind 
to different sites and regulate distinct genes 
in different cell types. Therefore, targets that 
are bound in only a few cells could be missed 
in whole-organism studies, and those that 
are found may constitute a superposition of 
binding sites from different cells. The authors 
partly deconvoluted these by determining the 
expression patterns for 180 genes, including 


13 of the transcription factors profiled, in the 
early embryo at single-cell resolution. 

Araya and colleagues’ data also include 
binding profiles for predicted transcription 
factors that are otherwise uncharacterized. 
This will allow hypotheses to be generated 
about the proteins’ possible functions, par- 
ticularly, for example, if the binding sites are 
enriched near certain types of gene'!"”. 

Akey feature of this rollout of ENCODE and 
modENCODE data are comparisons across 
the three species studied. Complementing 
Araya and colleagues’ data in worms, Boyle 
et al.° (page 453) present almost 500 new 
genome-wide binding maps for transcrip- 
tion-regulatory factors in human cell lines, 
Drosophila and C. elegans. They found that 
about half of the binding events in each 
species occur at high-occupancy target (HOT) 
regions'*"*, where binding is heavily clustered. 
Although the function of these regions has not 
been assessed, our work in Drosophila’* sug- 
gests that many are active enhancers, which 
trigger gene transcription. However, because 
factors can bind DNA without functional 
consequences, especially at HOT regions, the 
contribution of each of the bound factors to 
enhancer activity remains unclear. 

Apart from the existence of HOT regions, 
Boyle and colleagues’ data reveal only a few 
commonalities between the species. But this 
is not unexpected — regulatory connections 
and target genes for individual transcription 
factors vary substantially between different cell 
types in a single species, so it is not surprising 
that there is little overlap in data derived from 
samples as disparate as human cell lines and 
whole fly and worm embryos. Thus, although 
the data sets may be valuable in each of the 
species, their usefulness for studying the evo- 
lution of gene regulation in cross-species com- 
parisons is questionable, because such studies 
should compare homologous cell types that 
have shared developmental and functional 
properties. 

Ho and colleagues’ comparisons’ (page 449) 
focused on chromatin features that charac- 
terize regulatory genomic elements, such as 
DNA accessibility and certain modifications 
to histone proteins. In 800 new chromatin data 
sets, they identified several features common 
to the three species, including shared histone- 
modification patterns around genes and regu- 
latory regions. Gerstein et al. integrated this 
information with transcription data to present 
a ‘universal modef for predicting gene expres- 
sion. As the authors point out, these common- 
alities are not surprising’ and are in agreement 
with the modifications’ known distributions in 
each of the three species and in yeast. Instead, 
Ho and colleagues focused on the observed 
differences, which predominantly concern 
chromatin regions that are repressive (gene 
transcription from such regions is suppressed). 

These five papers represent a substan- 
tial addition to the public ENCODE and 


modENCODE resources. We expect the 
transcriptome data sets to have a direct influ- 
ence on gene annotations in all three species, 
which should affect the work of many research- 
ers immediately'™”. Itis arguably more difficult 
for scientists to easily access the data on chro- 
matin features and regulatory-factor binding 
sites, and the regulatory-element predictions. 
This needs integration with the community 
portals’®”’ and intuitive interfaces that allow 
data visualization and flexible analyses, which 
are being developed by the UCSC Genome 
Browser project and Ensembl, the two con- 
sortia, and others (such as i-cisTarget’ or 
GREAT”). The success of the ENCODE and 
modENCODE resources depends on such 
interfaces being integrated into workflows 
throughout the research community. 
Furthermore, although they are extremely 
data-rich, the papers expose how data sets 
that are created to catalogue all functional 
elements under standardized conditions are 
not sufficient for understanding the regula- 
tion of transcription, chromatin biology and 
enhancer function, nor the evolution of these 
mechanisms. Addressing such questions 
typically requires more-diverse set-ups and 
experiments, often specifically adjusted for 
each question. In addition, the identification 
of regulatory elements remains limited” by 
the lack of cell-type specificity and the fact 
that chromatin features and regulatory-factor 
binding are imperfect predictors of regula- 
tory-element function’. The papers do not 
reveal how many of these elements might be 
functional, and independent estimates span a 
broad range”"®. However, the new data, in con- 
junction with the work of many other groups, 
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will undoubtedly aid future research into the 
identification, functional characterization and 
understanding of genes, regulatory elements 
and animal genomes more generally. m 
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Supernova seen 
through y-ray eyes 


Observations of y-ray photons from a type Ia supernova indicate that stellar 
explosions of this kind get their energy from sudden thermonuclear fusion in the 


progenitor star. SEE LETTER P.406 


ROBERT P. KIRSHNER 


n page 406 of this issue, Churazov 

et al.’ report a great discovery — not 

because it is a surprise, but precisely 
because it is not. The researchers have detected 
y-ray emission lines from the type Ia super- 
nova 2014J in the nearby galaxy M82 using 
the European Space Agency’s INTEGRAL 
spacecraft. For decades, astronomers have 
been working out the physical picture for 
this type of exploding star on the basis of 


the optical light it emits. The authors’ study 
confirms directly the most fundamental idea 
in that picture by observing a supernova in the 
y-ray range of the electromagnetic spectrum. 
The y-rays they observed in the months after 
the supernova explosion were produced, as 
expected, by the radioactive decay of isotopes 
fused in a thermonuclear flame that destroyed 
a compact star. 

Astronomers react quickly to explosive 
events. On the evening of 21 January 2014, 
while supervising an undergraduate astronomy 
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Figure 1 | Supernova SN 2014J in nearby galaxy M82. This image is a composite of optical images 
taken with the Hubble Space Telescope. Churazov et al.' have used the INTEGRAL spacecraft to observe 
y-rays from SN 2014]. 


laboratory at University College London, 
Steve Fossey and his students noticed an extra 
star in M82, later named supernova (SN) 2014J 
(Fig. 1). The star was detected by accident, but 
its discovery set off a flurry of observations to 
confirm that it was a supernova’, determine its 
type and gather the most telling bits of data. 
Objects that are near the Sun have the most to 
teach us, and SN 2014J is the nearest type Ia 
supernova explosion since 1972. 

There are two main kinds of supernova: 
those that get their energy from the gravita- 
tional collapse of their progenitor stars, and 
those whose energy comes from catastrophic 
thermonuclear fusion in the stars — the type Ia 
class to which SN 2014J belongs. These explo- 
sions take about 3 weeks to reach their peak of 
4 billion times the Sun’s light output. After a 
more-or-less symmetrical peak, they exhibit 
a long exponential decay. Their bright peak 
means that they can be seen halfway across the 
Universe. They have a narrow range in bright- 
ness, once corrections to their optical emission 
are applied, so they can be used as ‘standard 
candles’ to gauge cosmic distances with good 
precision. 

That important application led to the 2011 
Nobel Prize in Physics for the discovery that 
the expansion of the Universe is accelerat- 
ing’. However, the fundamental idea that 
these supernovae result from the sudden 
thermonuclear fusion of carbon in a white- 
dwarf star has not had a direct test — until 
now. Although it is permissible to use objects 
that we do not fully understand to learn 
about the cosmos, it is much better to under- 
stand them. Churazov and colleagues have 
gone to the heart of the matter by directly 


measuring the results of the nuclear fusion that 
destroyed the progenitor star of SN 2014] and 
made it glow. 

Towards the end of its life, a star with the 
mass of the Sun will shed its outer layers as its 
core shrinks down to become a white dwarf. A 
white-dwarf star with the mass of the Sun is the 
same size as Earth. White dwarfs are durable 
ifleft alone. With no nuclear energy input and 
a small emitting surface, single white dwarfs 
just cool off slowly over time, gradually fading. 
But they are brittle — if a white dwarf acquires 
mass from a stellar companion, or smashes 
into another white dwarf, that extra weight 
can compress the carbon in the star’s core until 
this element undergoes nuclear fusion. Fusion 
happens in a flash: a thermonuclear flame rips 
through the white dwarf, fusing carbon into 
heavier elements with a sudden release of 
energy that tears the star apart. Fusion stops 
yielding energy at the element that has the 
most tightly bound nucleus — in the case ofa 
white dwarf, nickel-56 (*°Ni). 

An exploding white dwarf should be full of 
°°Ni nuclei — the pressure produced by the 
nuclear flame shreds the star and blasts these 
nuclei out at thousands of kilometres per 
second. In the first days after the explosion, 
the rise to the optical peak is powered by the 
radioactive decay of °°Ni, which has a half-life 
of 6.1 days and decays into cobalt-56 (*°Co), 
which has a 77-day half-life. This isotope then 
decays into stable iron-56 (°Fe; ref. 4), help- 
ing to power the emission from the supernova. 
Type Ia supernovae that exploded 5 billion 
years ago are the cosmic sources of the iron in 
the Sun, Earth and your blood. 

This is the conventional wisdom for 
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this class of supernova™®. Answers to my 
introductory astronomy exams get marked 
‘correct’ if you write these things, but it is 
always better to look at the book of nature 
than at any textbook. The direct test of this 
canonical model would be to detect y-rays 
emitted from *°Co as it decays to “Fe in the 
weeks after the explosion. At first, most of the 
y-rays are absorbed in the rapidly expanding 
material of the shredded white dwarf. This 
energy deposit accounts for the continuing 
luminosity of a type Ia supernova. To match 
the observed energy output of a type Ia super- 
nova requires about 0.4-0.6 solar masses of 
°°Ni in the expanding debris. Eventually, as 
the star’s material thins out, some of the y-rays 
begin to escape from the explosion without 
depositing their energy, so if you have the 
right kind of detector, you have a chance to 
observe them’. 

Churazov and colleagues did not miss their 
chance. They observed SN 2014] between 
days 50 and 100 after the explosion using the 
INTEGRAL satellite and measured the y-ray 
emission. As they show in their Figure 3 (see 
page 407 of the paper’), there is a new spot 
of high-energy emission at the location of 
SN 2014J that was not present in 2013 (before 
the explosion occurred). Moreover, the y-ray 
energy spectrum of the photons derived from 
that spot is a good match to a model spectrum 
of the y-rays that escape from the exploding 
star (see Fig. 1 of the paper’). The observed 
amount of y-ray emission corresponds to 
about 0.6 solar masses of *°Ni at the source, 
right in the ballpark of white-dwarf explo- 
sion models. The data are noisy because this 
measurement is at the limit of present technol- 
ogy, so it is not wise to trust too many details, 
but the basic results are clear: type Ia super- 
novae really are thermonuclear events. 

Upsetting the conventional wisdom is 
always a joy in science. You can get prizes for 
that. But there is also a deep pleasure in show- 
ing decisive evidence on an important physi- 
cal idea that has been used without proof for 
decades. The INTEGRAL data establish the 
thermonuclear explosion mechanism for 
type Ia supernovae. It is a wonderful result. = 
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Mammalian watchdog 
targets bacteria 


The aryl hydrocarbon receptor elicits protection against toxic environmental 
molecules. New data show that the receptor also supports the immune system by 
recognizing bacterially encoded virulence factors. SEE ARTICLE P.387 


PARAG KUNDU & SVEN PETTERSSON 


ll animals, including humans, are 
Az daily to a variety of chemi- 

cals in the air, water and food. Some 
of these carry valuable information about the 
host’s environment, such as the presence of 
food, predators, members of the opposite sex 
or, in the case of hyenas, members of the clan 
to which they belong’. But others are toxic and 
must be eliminated. Among several mecha- 
nisms for detecting and responding to these 
environmental cues is the aryl hydrocarbon 
receptor (AhR) protein, which can facilitate 
the biotransformation and elimination of toxic 
compounds encountered in the environment. 
On page 387 of this issue, Moura-Alves et al? 
report that bacterial compounds known as 
phenazines also act as potential AhR ligands, 
and that recognition of these virulence factors 
by AhR contributes to host defence against 
invading microbial pathogens. 

AhR is widely expressed in the mammalian 
body and is bound by a broad range of ligands 
that are mostly aromatic and hydrophobic 
compounds of endogenous or synthetic ori- 
gin’. The unbound receptor is retained in an 
inactive form in the cellular cytoplasm but 
moves to the nucleus following ligand bind- 
ing. Once in the nucleus, AhR has several 
functions, including marking sex-steroid 
receptors for destruction (by ubiquitination)* 
and inducing the transcription of a battery 
of target genes involved in the regulation of 
cellular stress and metabolism’. 

AhR has also been implicated in cross- 
talk with the immune system, particularly in 
promoting the differentiation of Th17 cells*. 
This suggests that the receptor may have 
a broad range of functions in addition to 
clearing unwanted chemical substances. 
Because the neutralization of microbial 
infections is one of the key functions of the 
mammalian immune system, Moura-Alves 
and colleagues used a molecular-modelling 
approach to test whether AhR senses ligands 
of bacterial origin. They found that pigmented 
virulence factors from pulmonary pathogens 
such as Pseudomonas aeruginosa and Myco- 
bacterium tuberculosis can bind to the ligand- 
binding domain of the receptor. They also 
provide evidence that this new class of ligand, 


namely phenazines from P. aeruginosa and 
phthiocol from M. tuberculosis, activate AhR 
in a dose-dependent manner, leading to the 
elimination of these virulence factors, pos- 
sibly through an AhR-controlled metabolic 
circuit (Fig. 1). 

Phenazines and phthiocol are versatile 
secondary metabolites synthesized by bacteria 
and are known to influence bacterial inter- 
actions with their hosts. Most of these com- 
pounds have antibiotic properties and play a 
key part in regulating cellular redox states and 
the generation of reactive oxygen species, thus 
enhancing the virulence of their manufacturer’. 
In plants, phenazines influence growth by eli- 
citing ‘induced systemic resistance’ and protec- 
tion against plant pathogens’. Thus, it seems 
that phenazine-producing bacteria display an 
interesting species-specific dichotomy, acting 
as symbionts in plants and as pathogens in ani- 
mals. Although phenazine-producing bacteria 
are often soil or plant-dwellers, they are also 
found in the normal human microbiota®. For 
instance, Nocardia species are a part of our oral 
microbiota and reside in our healthy oral cavity, 
and Methanocercina mazei is a component of 
our gut microbiome’. 
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The idea that microbial metabolites acti- 
vate AhR is not new, as it has long been 
known that indoles, a group of AhR ligands, 
are generated by bacterial metabolism of the 
amino acid tryptophan’. Lactobacilli, found 
among our gut microbes, produce indole- 
3-aldehyde as a tryptophan metabolite, and 
this seems to act as an AhR ligand, promot- 
ing host resistance to fungal pathogens”. 
This suggests that phenazine-dependent acti- 
vation of AhR may execute other functions in 
host responses to bacteria in addition to the 
clearance of virulence factors. 

To validate their findings in vivo, Moura- 
Alves et al. studied mice lacking the gene 
encoding AhR, and found that infection with 
phenazine-producing P. aeruginosa induced 
more-aggressive disease characteristics and 
increased bacterial load compared to mice 
with AhR. They also identified two classes of 
cell — myeloid and parenchymal cells — as the 
major contributors to this AhR-mediated host 
defence. An additional twist to the story comes 
from their finding that phthiocol sensing by 
AhR, especially by myeloid cells, increases 
resistance to M. tuberculosis infection and pre- 
vents systemic dissemination of the bacterium 
in mice. 

These findings establish a direct dialogue 
between AhR functions and invading patho- 
genic microbes, thereby consolidating the 
concept that AhR is an integral part of mam- 
malian immunity. This new function of AhR is 
somewhat surprising, given previous demon- 
strations that AhR activation impairs immune 
responses to a variety of pathogens including 
the influenza virus" and herpesviruses”. More- 
over, because our normal, non-pathogenic 
microbiome contains phenazine-producing 
bacteria, there must exist a form of tolerance 
that allows maintenance of these populations 
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Figure 1 | AhR senses bacterial virulence factors and regulates host defence. Moura-Alves et al.’ 
report that the mammalian aryl hydrocarbon receptor (AhR) senses pigmented bacterial virulence 
factors, including phenazines produced by Pseudomonas aeruginosa and phthiocol from Mycobacterium 
tuberculosis. Binding of these bacterial metabolites to AhR induces the receptor’s movement to the 
nucleus, where it activates the transcription of genes for toxin-metabolizing enzymes such as CYP1A1 
and CYP1B1. The authors suggest that AhR-induced increased expression of these enzymes eventually 
leads to the degradation of the virulence factors and subsequent clearance of the pathogens through 


host-defence mechanisms. 
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in specific sites, possibly mediated by an AhR- 
dependent mechanism. Indeed, a recent report 
portrays a ‘disease-tolerance defence pathway’ 
controlled by AhR, in which AhR-dependent 
tolerance against lipopolysaccharide, a compo- 
nent of the bacterial cell wall, imparts protec- 
tion against pathogenic invasion”’. Collectively, 
these findings provide evidence for AhR’s role 
in mammalian host defence against phenazine- 
producing bacterial infections and unfold an 
exciting chapter in our understanding of AhR 
functions. 

The diverse collection of AhR ligands, 
including hazardous chemical substances, 
metabolites from tryptophan, dietary ligands 
in fruits and cruciferous vegetables, and 
phenazines, suggest that this elusive ‘Scarlet 
Pimpernel -like receptor harbours a complex 
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and diverse repertoire of functions that 
remain to be discovered. Moura-Alves and 
colleagues’ findings spark the fascinating idea 
of an evolutionarily developed AhR-micro- 
biome connection, through which microbial 
communities can modulate host functions to 
reinstate the ‘survival of the fittest’. = 


Parag Kundu and Sven Pettersson are in 

the Lee Kong Chian School of Medicine and 
the Singapore Centre on Environmental Life 
Sciences Engineering, Nanyang Technological 
University, Singapore 637551. S.P. is also at the 
National Cancer Center, Singapore General 
Hospital, Singapore and in the Department 

of Microbiology, Tumor and Cell Biology, 
Karolinska Institute, Stockholm, Sweden. 
e-mail: sven. pettersson@ki.se 


What makes 
the Sun shine 


Neutrinos produced in the nuclear reaction that triggers solar-energy generation 
have been detected. This milestone in the search for solar neutrinos required a 
deep underground detector of exceptional sensitivity. SEE ARTICLE P.383 


WICK HAXTON 


remarkable detector of solar neutrinos 
called Borexino has operated for 
the past seven years in Italy’s Gran 
Sasso Laboratory, shielded by more than a 
kilometre of rock from the cosmic rays that 
bombard Earth's surface. A prolonged effort 
has reduced background signals from radio- 
active elements present in the detector that 
would otherwise obscure the neutrino signal. 
On page 383 of this issue, the Borexino Col- 
laboration' reports how this background sup- 
pression has enabled direct detection of the 
low-energy neutrinos produced in the nuclear 
reaction that initiates solar-energy generation. 
Attempts to identify the source of solar 
energy have a long history”. In the nineteenth 
century, gravitational contraction was the 
only known mechanism that could plausibly 
account for the Sun’s luminosity and extended 
lifetime. Because this limited the Sun's age to 
about 30 million years, physicist Willliam 
Thomson (Lord Kelvin) concluded that Charles 
Darwin's geological estimate of 300 million 
years for the age of the Earth was incorrect. 
The beginning of the twentieth century 
marked the discoveries of the equivalence of 
mass and energy, embodied in Albert Einstein's 
famous equation E= mc’, and of radioactive 
decays in which a nucleus changes its charge 
through weak interactions. Following Francis 


Aston’s measurements in 1920 of the mass dif- 
ference between four protons and a helium-4 
(‘He) nucleus, Arthur Eddington proposed 
that the source of solar energy is the fusion of 
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four protons to form “He. Later, responding to 
criticism that the Sun is not sufficiently hot to 
sustain nuclear fusion, Eddington invited his 
critic to “go and find a hotter place”. 

This dispute was resolved by George 
Gamow, who showed that quantum tunnelling 
would allow two solar protons to approach one 
another within the range required for nuclear 
fusion to occur. The detailed reactions lead- 
ing to the synthesis of “He were then deduced’: 
the proton-proton (pp) chain (Fig. 1) in the 
case of small, slowly evolving stars such as the 
Sun, and the carbon-nitrogen (CN) cycle in 
more-massive, rapidly evolving stars. Steady 
nuclear-energy release in the solar core keeps 
the temperatures high, ionizing hydrogen (H) 
and “He and producing a plasma in which 
the electrons act as a gas. The Sun burns ina 
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Figure 1 | The pp chain. The proton-proton (pp) chain of reactions is the dominant mechanism by 
which four protons are fused in the Sun to produce “He. Its three cycles, labelled ppl, ppII and pplll, are 
each accompanied by a distinctive electron neutrino (v,). In cases in which two reactions compete, the 
relative percentages of the two branches are indicated (taken from the standard solar model”). p, Proton; 
°H, deuterium; e*, positron; e”, electron; y, y-ray; °“He, helium-3 and -4; “Be, beryllium-7; ’Li, lithium-7; 
8B, boron-8. The Borexino Collaboration’ reports the first measurement of the neutrinos associated with 
the p+p reaction, and earlier measured the neutrinos from the ‘Be and p +e +p reactions. 
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long-lasting hydrostatic equilibrium, with the 
effects of gravity counterbalanced by the elec- 
tron-gas pressure. The rate of energy genera- 
tion must compensate for the radiative losses 
from the Sun’s surface. The Sun has maintained 
this balance for 4.6 billion years by consuming 
about half the hydrogen fuel in its core. 

This description of the Sun also applies to 
about 90% of the stars in the Milky Way. How 
can our understanding of energy generation 
in such ‘main sequence’ stars be tested? Con- 
ventional observations are limited to the Sun’s 
surface: they do not directly probe the deep 
interior, where the temperatures of 10’ kelvins 
necessary for fusion are found. Yet just as the 
Sun’s surface shines in photons, its core shines 
in neutrinos. Each synthesized “He nucleus 
requires the conversion of two protons to two 
neutrons, with each neutron being accompa- 
nied by an electron-type neutrino. The neutri- 
nos emerge directly from the core and reach 
Earth in eight minutes, carrying in their flux 
and energy distribution a detailed account 
of the Sun’s fusion processes. The neutrino 
core-shine at Earth is intense (10"' neutrinos 
per square centimetre per second) but diffi- 
cult to detect, because neutrinos rarely interact 
with matter as they pass through it. For this 
reason, the bulk of these neutrinos have evaded 
direct detection — until now. 

All but 1% of solar “He synthesis takes place 
through the pp chain, which releases two elec- 
tron neutrinos and 26.73 million electronvolts 
of energy. The three cycles of this chain (ppI, 
ppll and pplIil) are each associated with a char- 
acteristic neutrino source. All three cycles 
begin with the fusion of two protons in the 
solar plasma to form deuterium (7H), through 
the ‘pp’ and ‘pep reactions (Fig. 1). The former 
accounts for 99.76% of deuterium synthesis, 
and thus determines the rate of solar-energy 
generation. 

In early experiments to detect neutrinos 
— the historic chlorine experiment’, Kamio- 
kande’ and GALLEX/SAGE” — the number 
of neutrinos recorded was about one-half to 
one-third of that predicted by theory, indi- 
cating that a basic flaw existed in our under- 
standing of either the Sun or the physics 
of neutrinos. This solar-neutrino problem 
motivated the building of a new generation 
of massive detectors, namely, the Sudbury 
Neutrino Observatory’, Super-Kamiokande’ 
and Borexino (Fig. 2). The Sudbury Neutrino 
Observatory and Super-Kamiokande, which 
detected the high-energy neutrinos produced 
in the B-decay of boron-8 (*B) in the pplII 
cycle (Fig. 1), traced the problem to new par- 
ticle physics: neutrinos have a mass and can 
change ‘flavour’ during their transit from the 
solar core to Earth. Two-thirds of solar °B elec- 
tron neutrinos oscillate into other types (muon 
and tau neutrinos) before reaching Earth. 

The ‘B neutrinos comprise about 0.01% of 
the total flux of neutrinos coming from the Sun. 
The Borexino detector was designed to detect 
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Figure 2 | The Borexino detector. A view inside the Borexino detector showing its nested nylon vessels, 
as the inner volume is being filled with 278 tonnes of liquid scintillator. Solar neutrinos are detected 
through their scattering off electrons in the scintillator: the scattered electrons emit scintillation light 
that is recorded in phototubes. The photo was taken from a camera mounted on the 14-metre-diameter 
stainless-steel sphere that surrounds the detector. The camera looks out between several of the cones (in 
the foreground, and thus seemingly enlarged) that help to direct the light into the phototubes. 


the remaining, lower-energy solar neutrinos 
through their scattering off electrons in liquid 
scintillator: the recoiling electrons emit light 
that is recorded in the detector. Measure- 
ments of the low-energy neutrinos produced 
in the pep reaction and the pplI cycle have 
been announced previously'®". Now, after 
an extended detector-purification campaign, 
the Borexino Collaboration has measured the 
pp neutrinos — the lowest-energy neutrino 
branch, accounting for 90% of the total flux’. 

This result provides an important test of 
how matter affects neutrino oscillations — 
oscillations occurring within the Sun dif- 
fer from those occurring in a vacuum. Such 
‘matter effects’ can be exploited to determine 
whether a given neutrino is heavier or lighter 
than another neutrino. Previous experiments*” 
fixed the relative masses of two of the three neu- 
trinos in this way. The theory used predicts” 
that oscillation probabilities will be lower for pp 
neutrinos than for higher-energy *B neutrinos: 
a larger fraction of the pp neutrinos will arrive 
at Earth as electron neutrinos. The fraction 
found by Borexino, 0.64 + 0.12, is nearly twice 
that found for *B neutrinos. This verification 
of theory is important because future planned 
‘long-baseline’ neutrino-beam experiments will 
exploit matter effects to determine the ordering 
among all three neutrino masses. 

The Borexino results also provide new 
tests of the Sun. Most solar-neutrino analyses 
assume that the total flux of neutrinos is con- 
sistent with the solar luminosity. But the con- 
nection between neutrino emission, which 
measures the rate of energy generation in the 
solar core today, and luminosity is valid only 
in a steady-state Sun: because photons take 


about 100,000 years to diffuse out of the core, 
this connection would not hold if the tempera- 
ture of the solar core varies on times less than 
100,000 years. Tests of this connection thus con- 
strain solar variability and well as certain new- 
physics phenomena, such as solar emission of 
undetected ‘sterile’ neutrinos. Because the solar 
luminosity has been measured to a precision of 
0.01%, checks on this relationship are limited 
by neutrino-flux uncertainties. The Borexino 
Collaboration notes that the 10% uncertainty 
of its pp neutrino-flux determination could be 
reduced to 1% in an improved experiment. 
Now thatall four principal neutrino sources 
from the pp chain have been directly meas- 
ured, one task remains: about 1% of solar “He 
synthesis takes place through the CN cycle. 
Neutrinos produced through this channel 
have not yet been detected. Because the CN 
cycle is catalysed by reactions on C andN, its 
rate is proportional to the solar core’s metal- 
licity (the fraction of elements other than H 
or “He). Consequently, the core metallicity 
can be deduced from the CN neutrino flux. A 
measurement of these neutrinos could directly 
confirm the solar-abundance problem”: helio- 
seismic data suggest that the solar interior is 
metal-rich relative to the solar surface, contra- 
dicting the standard-solar-model assumption 
of a homogeneous primordial Sun, formed 
from gas that had mixed convectively. 
Measuring CN neutrinos is a final chal- 
lenge for Borexino, requiring further progress 
in background suppression. One of the can- 
didate explanations” for the solar-abundance 
problem is the effect of planetary formation on 
the early Sun: this process swept a great deal 
of metal from gas that, if then deposited on 
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the Sun’s surface, could have diluted the star’s 
outer convective zone. The demonstration of 
a connection between a host star’s metallicity 
and the presence of its planets would have sig- 
nificant implications for exoplanet searches.m™ 
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Hybrid sensors 
ring the changes 


An improved design for a class of magnetometer greatly increases the sensitivity 
of these devices — and might be the vanguard of a new generation of hybrid 
sensors that combine different types of signal to increase sensitivity. 


JORG WRACHTRUP & AMIT FINKLER 


hen seventeenth-century sailors 

undertook their dangerous jour- 

neys around the Cape of Good 
Hope or to the Spice Islands, they had an inval- 
uable navigational tool on board: the compass. 
Only with the aid of this, the most precise 
measurement device of those times, were they 
able to accomplish their daring feats. Since 
then, precision measurements of magnetic 
fields have been key drivers of basic science and 
of a whole wealth of technologies with appli- 
cations ranging from navigation to the medi- 
cal sciences. Writing in Advanced Materials, 
Forstner et al.' report a vastly improved design 
for a certain class of magnetometer — a hybrid 
sensor that measures magnetic fields using an 
optical signal. 

Various technologies have been developed 
to measure magnetic fields. The record hold- 
ers for sensitivity are devices known as atom- 
vapour magnetometers’. Others can measure 
the dipolar magnetic interaction between 
two electrons’, and still others, most notably 
diamond spin sensors, have a compact design 
that might allow magnetometers to be made 
as small as one nanometre across. The central 
element of Forstner and colleagues’ sensor is a 
‘whispering gallery mode’ micro-resonator: a 
doughnut-shaped device that traps light waves 
and is known for its strong optical confinement 
and ultra-high quality factor (a measure of the 
energy lost from the device; high quality factors 
correspond to low rates of energy loss). The fre- 
quency at which light resonates in the dough- 
nut depends on the doughnut’s geometry. 

To sense magnetic fields, Forstner et al. 
combined the resonator with a magnetostric- 
tive material. This material expands in a mag- 
netic field and thereby exerts a force on the 
cavity system, changing the micro-resonator’s 


shape and thus its resonance frequencies. So, 
by measuring the change in resonance fre- 
quencies, the strength of the magnetic field 
can be determined. 

The authors placed the magnetostrictive 
material in the middle of the doughnut, where 
it does not interfere with optical measurements 
and is most effective in changing the shape of 
the resonator (Fig. 1). Because of the high qual- 
ity factor of the device, tiny distortions of its 
shape — and therefore tiny changes of mag- 
netic field — can be detected. In a proof-of- 
principle experiment, Forstner and co-workers 
demonstrated that their device has a magnetic- 
field sensitivity of 100 picotesla (1 picotesla is 
10°” tesla) for measurements of 1 second dura- 
tion. Other kinds of magnetometer, such as 
atom-vapour types, are sensitive to much lower 
fields*° (less than 1 femtotesla; 1 femtotesla 
is 10°’ tesla), but Forstner and co-workers’ 
approach is three orders of magnitude more 
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sensitive than previous hybrid optomechanical 
sensors’, mostly because of the integration of 
the magnetostrictive material into the micro- 
resonator. Moreover, the researchers extended 
the frequency range of magnetic fields that can 
be sensitively detected by the sensor by using 
the nonlinear properties of the magnetostric- 
tive material. 

Compared with other, more-sensitive sensor 
types, the main advantage of Forstner and col- 
leagues’ device is its small size, which is limited 
to the size of the doughnut (about a few tens of 
micrometres). Furthermore, the device works 
at room temperature and tolerates background 
magnetic fields. This opens up various poten- 
tial applications, such as navigation without 
the need for a global positioning system, and 
the detection of weak biomagnetic fields from 
neurons. 

Forstner and co-workers’ achievements 
follow a recent trend in sensor design: the use 
of optomechanical sensors, in which opti- 
cal and mechanical signals are combined to 
enhance sensitivity. By incorporating advances 
in nanophotonics and the fabrication of 
nanoscale structures, optomechanical sen- 
sors have been made that perform better than 
their electromechanical counterparts, mostly 
because the noise in optical readouts is much 
lower than in electronic ones. Excellent exam- 
ples include state-of-the-art force sensors’ and 
accelerometers’, and a detector of microwave 
photons’. 

Hybrid designs based on principles other 
than optomechanics might also be adopted to 
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Light 


Figure 1 | An optomechanical magnetometer. Forstner et al.' have constructed a magnetic-field 

sensor in which a doughnut-shaped ‘whispering gallery mode resonator device surrounds a piece of 
magnetostrictive material. Light is guided through a tapered optical fibre into the sensor, where it is trapped 
by the resonator. In the presence of a magnetic field, the magnetostrictive material expands, distorting 

the resonator and changing the frequency at which light resonates in the device. The change of resonance 
frequency is measured by a spectrum analyser and used to determine the strength of the magnetic field. 
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improve the sensitivity of sensors. Consider 
diamond spin sensors. Rather than using 
optical resonances, diamond spin sensing 
relies on the precision measurement of tran- 
sitions between resonance states that are 
associated with a quantum property of elec- 
trons: spin. Such sensors can be responsive to 
various quantities, including magnetic’® and 
electric fields", temperature”, pressure” and 
strain’. But the sensitivity of these devices — 
particularly for temperature and strain — is 
limited by the nature of the ground-state spin 
wave function of the electrons used in the 
sensing mechanism”. 

It has been proposed” that this problem 
could be overcome by constructing a hybrid 
sensor using the same magnetostrictive mater- 
ial as that used by Forstner and colleagues. 
However, the material would function in 
exactly the opposite way to that reported by 
these authors: a strain (or an electric field) 
would generate a magnetic field, which is sen- 
sitively detected by the electron spins, which in 
turn are read out optically. The resulting dia- 
mond hybrid sensor is predicted to be about 
1,000 times more sensitive for pressure, force 
or electric field than diamond spins alone, and 
would retain excellent spatial resolution. 

Hybrid sensors thus seem to be an upcoming 
theme in sensor technology. An important fur- 
ther step will be to use advances in quantum 
technology to achieve the limits of accuracy. 


BEHAVIOURAL ECOLOGY 


The resulting quantum hybrid sensors could 
potentially revolutionize sensor technology 
in various disciplines, enabling unprece- 
dented opportunities in technology and basic 
science. m 
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Love thy neighbour 


A theoretical model suggests that the cause of female-driven extra-pair mating 
lies in the spreading of male interests among neighbouring families, creating 
powerful incentives for male cooperation and concomitant benefits for females. 


BEN C. SHELDON & MARC MANGEL 


irds have long served as paragons of 
B contented coupledom, and with apparent 

justification, given that more than 90% 
of bird species breed in socially monogamous 
pairs, with the male and female seemingly 
cooperating to rear offspring. But appear- 
ances can be deceptive. In fact, in the majority 
of bird species tested — data are available for 
more than 200 species — a substantial propor- 
tion (sometimes exceeding 50%) of offspring 
are sired by a male other than that providing 
parental care. A huge, but rather fruitless}, 
effort to explain the evolution of female extra- 
pair mating behaviour in birds has focused on 
potential genetic benefits to females. Now, writ- 
ing in PLoS ONE, Eliassen and Jorgensen’ offer 
an alternative explanation: that extra-pair mat- 
ing by females creates an incentive for males to 
cooperate with their neighbours, generating a 


form of collective good that may be a powerful 
evolutionary force. 

The development in the 1980s of genetic 
markers to assign parentage revolutionized 
our understanding of mating patterns in many 
organisms, but nowhere was the effect greater 
than in birds’. Before this, most birds seemed 
rather uninteresting from the perspective of 
sexual selection, because it was expected that 
social monogamy, reinforced by the need for 
rapidly growing offspring to receive extensive 
care from both parents, would limit the extent 
to which sexual selection could influence fit- 
ness (in evolutionary terms, the likelihood 
of survival and reproductive success). The 
realization that extra-pair paternity could, in 
principle, greatly increase the fitness variation 
among males relative to females, hence creat- 
ing the potential for strong sexual selection, 
led to a vigorous interest in testing the causes 
and consequences of extra-pair copulation 
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50 Years Ago 


‘Obituary: Prof. James Franck’ — 
I remember his famous lecture 

... which gave strong support to 
Bohr’s new theory of the atom. 
Einstein said to me: “It’s so lovely, 
it makes youcry!” ... In 1920 
Franck was called to a chair in 
Gottingen ... many honours ... 
have come his way, among them 
the Nobel Prize in Physics in 1926 
jointly with Hertz ... Gottingen 
saw Franck’s happiest and most 
fruitful period ... Those happy 
days ended when Hitler came to 
power. When the racial laws were 
announced, Franck published a 
courageous open letter in which 
he resigned his chair ... He was 
the most lovable of men because 
he loved people; kindness shone 
from his eyes. There must be many 
beside myself who now feel they 
have lost one of their best friends. 
Lise Mietner 

From Nature 29 August 1964 


100 Years Ago 


The Trail of the Sandhill Stag by 

E. T. Seton; Wild Game in 
Zambezia by R. C. F. Maugham 
— The first of the books before us 
aims rather at being a work of art 
than of natural history ... Never 
since the days of melancholy 
Jacques was such an outpour of 
sentiment upon a stricken deer. 
To those who like this mood the 
book may be recommended, for it 
is curiously wrought and daintily 
embellished ... The second book 
on the list strikes quite another 
strain ... But the list of stores 
required by two persons for a 

trip of two months is startling. It 
almost shakes one’s confidence in 
the author to learn that he cannot 
go into camp with a friend for two 
months without a dozen tins each 
of lobster and salmon, two dozen 
tins of sausages, and three dozen 
tins of fruit in syrup. 

From Nature 27 August 1914 
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Figure 1 | Potential benefits of extra-pair mating. Genetic analysis of parentage in socially 
monogamous birds has revealed many cases in which female-driven extra-pair mating leads to males 
caring for offspring that are not their own. a, Previous studies to explain such behaviour focused on the 
idea that females gain genetic benefits from this behaviour. But the genetic benefit is small (or absent), 
because it is gained only from extra-pair copulations that result in fertilization. b, A new model by 
Eliassen and Jorgensen’ argues that the key force in the evolution of extra-pair mating in birds lies in the 
way that the matings distribute potential paternity among neighbours and therefore create incentives for 


cooperation among neighbouring males. 


(EPC) behaviour and resulting extra-pair 
fertilizations. 

EPC behaviour in birds is frequently initiated 
by females, and the only obvious benefit associ- 
ated with extra-pair fertilizations seemed to be 
genetic — the provision of genes from the extra- 
pair partners to offspring. So this seemed an 
ideal system for testing the ‘good gene’ benefits 
of sexual selection (Fig. 1a). Over time, however, 
a large body of empirical work, exemplified by 
recent comprehensive studies on American 
song sparrows (Melospiza melodia)’, led to the 
conclusion that the genetic benefits of EPCs are 
at most small. This sparked the suggestion’ that 
there has been too much emphasis on adaptive 
explanations for the evolution of female EPC 
behaviour, and that non-adaptive explana- 
tions, such as inter-sexual genetic constraints 
(female behaviour resulting from selection in 
males), should be more rigorously explored. 
The hypotheses arising from such alternative 
explanations are difficult to test empirically, 
although one study” of captive zebra finches 
was supportive of genetic constraints. 

Now, however, Eliassen and Jorgensen use 
evolutionary modelling to suggest that EPC 
behaviour in females is indeed adaptive, but 
that researchers interested in explaining it have 
simply been looking in the wrong place. The 
authors’ model (Fig. 1b) argues that female 
EPC behaviour creates an incentive for males 
to cooperate with neighbours in a wide range 
of behaviours that add together to a common 
good. The incentive occurs because, rather 
than all of a male’s expected paternity being 


limited to a single nest, female EPC spreads the 
likelihood of paternity among neighbouring 
nests. Given this uncertain redistribution ofa 
male’s reproductive interests, Eliassen and Jor- 
gensen predict an associated shift by males to 
invest more in behaviours that benefit neigh- 
bouring nests as well. 

What form might such behaviours take? 
In the context of socially monogamous birds, 
collective behaviour to discourage predators 
or infanticide by males represents one poten- 
tially important mechanism. Equally, this logic 
might operate through a reduction in territo- 
rial or aggressive interactions between neigh- 
bours, or by resource sharing during offspring 
rearing, such as through a relaxation in territo- 
rial boundaries. A strength of the new model is 
that the focus on what type of benefit females 
gain from extra-pair mating changes from 
genetic to ecological. Predation and food limi- 
tation are commonly identified as contributing 
to variation in fitness in wild populations, and 
the effects can be large. 

One of the puzzles in understanding EPC 
behaviour in socially monogamous species is 
that it occurs in systems that seem likely to be 
destabilized by such behaviour. Indeed, there 
is mounting comparative evidence, from social 
insects® as well as birds’, that cooperation is 
unlikely to evolve within extended families 
unless there is also genetic monogamy. The 
perspective offered by Eliassen and Jorgensen 
is that a different form of cooperation — 
between unrelated individuals — may actu- 
ally be promoted by promiscuity. However, 
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there will inevitably be a trade-off between the 
group-defence benefits accorded by redistribu- 
tion of paternity and the expected reduction in 
a male’s care for offspring in his ‘own’ nest® as 
his relatedness to these offspring falls; the reso- 
lution of this trade-off will probably depend 
on the relative costs and benefits of the two 
processes. Thus, their model is as much about 
the evolution of cooperation as it is about EPC 
behaviour. Furthermore, because the authors 
used an adaptive-dynamics method’ to build 
their model, they have laid the groundwork for 
investigating the broader goal of understand- 
ing the emergence of societies as complex 
adaptive systems”. 

Eliassen and Jorgensen show that many of 
their model’s predictions, when combined with 
life-history differences between species, accord 
well with empirical data from field studies. For 
instance, the model predicts that extra-pair 
sires should usually be immediate neighbours, 
and that exchange of paternity between males 
will be commonplace — both patterns are 
commonly seen in the field. Across species, 
the model explains why extra-pair paternity 
will be less frequent as longevity increases, 
a non-intuitive result that accords well with 
the evidence, but is also predicted by other 
models. At present, no data exist that could 
test this model to the exclusion of others, 
although the model may, as good theory 
should, challenge empiricists to design more 
discriminatory tests. Nevertheless, Elias- 
sen and Jorgensen’s work represents a radical 
shift in focus of tests of the costs and benefits 
of extra-pair mating in birds, and has broad 
consequences for our understanding of the 
evolution of cooperation among interacting, 
but non-related, individuals. = 


Ben C. Sheldon is at the Edward Grey 
Institute, Department of Zoology, University of 
Oxford, Oxford OX1 3PS, UK. Marc Mangel 
is at the Center for Stock Assessment Research, 
University of California, Santa Cruz, 

Santa Cruz, California 95064, USA. 

e-mails: ben.sheldon@zoo.ox.ac.uk; 
msmangel@ucsc.edu 


1. Forstmeier, W., Nakagawa, S., Griffith, S.C. & 
Kempenaers, B. Trends Ecol. Evol. 29, 456-464 
(2014). 

2. Eliassen, S. & Jorgensen, C. PLoS ONE 9, e99878 
(2014). 

3. Griffith, S. C., Owens, |. P. F. & Thuman, K. A. Mol. 
Ecol. 11, 2195-2212 (2002). 

4. Sardell, R. J., Arcese, P, Keller, L. F. & Reid, J. M. 
Am. Nat. 179, 779-793 (2012). 

5. Forstmeier, W., Marin, K., Bolund, E., Schielzeth, H. 
& Kempaenaers, B. Proc. Nat! Acad. Sci. USA 108, 
10608-10613 (2011). 

6. Hughes, W. O. H., Oldroyd, B. P., Beekman, M. & 
Ratnieks, F. L. W. Science 320, 1213-1216 (2008). 

7. Cornwallis, C. K., West, S. A., Davis, K. E. & Griffin, A. S. 
Nature 466, 969-972 (2010). 

8. Sheldon, B.C. Phil. Trans. R. Soc. Lond. B 357, 
341-350 (2002). 

9. Dercole, F. & Rinaldi, S. Analysis of Evolutionary 
Processes (Princeton Univ. Press, 2008). 

10.Levin, S. Proc. Natl Acad. Sci. USA 111 (Suppl. 3), 
10781-10784 (2014). 


ARTICLE 


doi:10.1038/nature13702 


Neutrinos from the primary 
proton-proton fusion process in the Sun 


Borexino Collaboration* 


In the core of the Sun, energy is released through sequences of nuclear reactions that convert hydrogen into helium. The 
primary reaction is thought to be the fusion of two protons with the emission of a low-energy neutrino. These so-called 
pp neutrinos constitute nearly the entirety of the solar neutrino flux, vastly outnumbering those emitted in the reactions 
that follow. Although solar neutrinos from secondary processes have been observed, proving the nuclear origin of the 
Sun’s energy and contributing to the discovery of neutrino oscillations, those from proton-proton fusion have hitherto 
eluded direct detection. Here we report spectral observations of pp neutrinos, demonstrating that about 99 per cent of 
the power of the Sun, 3.84 x 10°? ergs per second, is generated by the proton-proton fusion process. 


We have known for 75 years that the energy generated by stars comes 
from the fusion of light nuclei into heavier ones’. In the Sun, hydrogen 
is transformed into helium predominantly via the pp cycle*’, a chain of 
reactions releasing 26.73 MeV and electron neutrinos v,, and summar- 
ized as 


4p — *He + 2e* + 2, 


The cycle begins with the fusion of two protons into a deuteron, which 
occurs 99.76% of the time® by means of the primary reaction 


ptp—>7*H+e* +, 


Neutrinos produced in this step are referred to as pp neutrinos. Some 
of the nuclear reactions that follow also produce neutrinos of various 
energies. *He may also be formed through the CNO (carbon-nitrogen- 
oxygen) cycle’, which is thought to be predominant in heavy stars, but 
to produce at most 1% of the Sun’s energy’*. Present models of the Sun® 
precisely predict the flux and energy distribution of emitted neutrinos 
(Fig. 1). So far, only the radiochemical gallium experiments (after the first 
observation by GALLEX'"™" and, later, by SAGE”) have been sensitive to 
pp solar neutrinos (0 < E< 420 keV). However, by measuring only an 
integrated flux of all solar electron neutrinos above an energy threshold 
(233 keV), the pp neutrino flux could be extracted only indirectly, by 
combining the GALLEX and SAGE measurements with those of other 
experiments'*!”. 

The Borexino experiment came online in 2007 with high sensitivity 
to all solar neutrino components, particularly those below 2 MeV (Fig. 1). 
Borexino has made the first measurement of Be neutrinos!’ and proton- 
electron-proton (pep) neutrinos'®, measured *B neutrinos’ at a lower en- 
ergy threshold than other experiments, and set the best available limit on 
the solar CNO neutrino component”. The detection of pep neutrinos 
itself indirectly indicates the existence of pp neutrinos, because the p 
+e+tp—> ?H + v, reaction is a rare (0.24%; ref. 6) alternative first step 
of the pp cycle. Attempts to measure pp neutrinos directly over the past 
30 years (see ref. 20 for a recent review) have been hindered by the in- 
ability to sufficiently suppress radioactive backgrounds in this low-energy 
region. The Borexino detector, which is designed to minimize back- 
grounds from radioactive isotopes both within, and external to, the liquid 
scintillator target, made it possible to search for the very low-energy pp 


neutrinos. The measured solar pp neutrino flux is (6.6 + 0.7) x 
10'°cm~’s ', in good agreement with the prediction of the standard 
solar model? (SSM) (5.98 X (1 + 0.006) X 10'!°cm7s~ 1). 

The observation of pp neutrinos provides us with a direct glimpse at 
the keystone fusion process that keeps the Sun shining and strongly re- 
inforces our theories on the origin of almost the entirety of the Sun’s en- 
ergy. Their measured flux can also be used to infer the total energy radiated 
by the Sun, 3.84 x 107° ergs 1 However, because photons produced in 
the Sun’s core take a very long time (at least a hundred thousand years; 
ref. 21) to reach the surface, neutrino and optical observations in com- 
bination provide experimental confirmation that the Sun has been in 
thermodynamic equilibrium over such a timescale. 


Searching for pp neutrinos with Borexino 

The Borexino experiment (Methods) detects solar neutrinos by mea- 
suring the energy deposited in the liquid scintillator target by recoiling 
electrons undergoing neutrino-electron elastic scattering: 


Vy tev, +e (1) 


where x denotes one of the three neutrino flavours (e, 1, t). The 
detector is fully described in ref. 22. 

The solar neutrino flux reaching the Earth is composed not only of 
electron neutrinos produced in the nuclear reactions in the Sun, but, 
owing to the process of flavour oscillations (Methods), also of muon 
and tau neutrinos. The pp neutrino energy spectrum extends up to 
420 keV, yielding a maximum electron recoil energy of Emax = 264 keV 
(ref. 23). The expected flux of pp neutrinos is calculated in the frame- 
work of the SSM (Methods). The most recent calculations are those 
of ref. 9 (other models, such as the one described in ref. 24, give similar 
results). The predictions for the total flux of pp neutrinos at Earth’ range 
between 5.98 X (1 + 0.006) X 10'°cm™’s_', for the high-metallicity 
model, and 6.03 X (1 + 0.006) X 10'°cm 7s +, for the low-metallicity 
model. The latest values of the neutrino oscillation parameters” are 
needed to calculate the relative proportions of the three flavours within 
the solar neutrino flux at Earth. For reference, when combining them 
with the high-metallicity SSM prediction (assumed throughout this 
paper unless otherwise specified) and using neutrino-electron scatter- 
ing cross-sections derived from refs 25, 26 (also P. Langacker and J. 
Erler, personal communication), we expect the pp neutrino interaction 
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2 + 
p+ip =H +et'+ 4, 


Solar neutrino flux 


Neutrino energy (MeV) 


Figure 1 | Solar neutrino energy spectrum. The flux (vertical scale) is given in 
cm *s ‘MeV! for continuum sources and in cm *s’ ' for mono-energetic 
ones. The quoted uncertainties are from the SSM’. 


(equation (1)) rate to be 131 + 2 counts per day (c.p.d.) per 100t of 
target scintillator. 

The scintillation light generated by a 100 keV event typically induces 
signals in ~50 photomultiplier tubes (PMTs). This allows for a low de- 
tection threshold (~50 keV), much less than the maximum electron recoil 
energy of pp neutrinos (Eyax = 264 keV). 

The pp neutrino analysis is performed through a fit of the energy dis- 
tribution of events selected to maximize the signal-to-background ratio. 
The selection criteria (Methods) remove residual cosmic muons, decays of 
muon-produced isotopes, and electronic noise events. Furthermore, to 
suppress background radiation from external detector components, only 
events whose position is reconstructed inside the central detector volume 
(the ‘fiducial volume’: 86 m*, 75.5 t) are used in the analysis. The fit is done 
within a chosen energy interval and includes all relevant solar neutrino 
components and those from various backgrounds, mostly from resid- 
ual radioactivity traces dissolved in the scintillator. 

Figure 2 shows a calculation of the spectral shape of the pp neutrino 
signal (thick red line), as well as of the other solar neutrino components 
(Be, pep and CNO), and of the relevant backgrounds (WG intrinsic to 
the organic liquid scintillator; its ‘pile-up’ (see definition below); 210B;. 
20D; Kr: and? MDb), all approximately at the observed rates in the data. 
The pp neutrino spectral component is clearly distinguished from those 
of ®°Kr, 7!°Bi, CNO and Be, all of which have flat spectral shapes in the 
energy region of the fit. Most of the pp neutrino events are buried 
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Figure 2 | Energy spectra for all the solar neutrino and radioactive 
background components. All components are obtained from analytical 
expressions, validated by Monte Carlo simulations, with the exception of the 
synthetic pile-up, which is constructed from data (see text for details). 
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under the vastly more abundant MC whichisa B-emitter with a Q value 
of 156 keV. In spite of its tiny isotopic fraction in the Borexino scintil- 
lator (‘*C/'7C ~ 2.7 x 10 '*), 4C B-decay is responsible for most of the 
detector triggering rate (~30 countss ' at our chosen trigger thresh- 
old). The '““C and pp neutrino energy spectra are, however, distinguish- 
able in the energy interval of interest. 

The '*C rate was determined independently from the main analysis, 
by looking at a sample of data in which the event causing the trigger is 
followed by a second event within the acquisition time window of 16 1s. 
This second event, which is predominantly due to ‘C, does not suffer 
from hardware trigger-threshold effects and can thus be used to study 
the rate and the spectral shape of this contaminant. We measure a ‘“C 
rate of 40 + 1 Bq per 100t. The error accounts for systematic effects due 
to detector response stability in time, uncertainty in the '*C spectral 
shape”, and fit conditions (Methods). 

An important consideration in this analysis were the pile-up events: 
occurrences of two uncorrelated events so closely in time that they can- 
not be separated and are measured as a single event. Figure 2 shows the 
expected pile-up spectral shape, which is similar to that of the pp neutrinos. 
Fortunately, the pile-up component can be determined independently, 
using a data-driven method, which we call ‘synthetic pile-up’ (Methods). 
This method provides the spectral shape and the rate of the pile-up com- 
ponent, and is constructed as follows. Real triggered events without any 
selection cuts are artificially overlapped with random data samples. The 
combined synthetic events are selected and reconstructed using the same 
procedure applied to the regular data. Thus, some systematic effects, such 
as the position reconstruction of pile-up events, are automatically taken 
into account. The synthetic pile-up is mainly due to the overlap of two '*C 
events, but includes all possible event combinations, for example MC with 
the external background, PMT dark noise or *!°Po. '*C-'4C events dom- 
inate the synthetic pile-up spectrum between approximately 160 and 
265 keV. The fit to the ‘“C-"*C pile-up analytical shape in this energy 
region gives a total rate for '“C-'“C pile-up events of 154 + 10 cp.d. per 
100t in the whole spectrum, without threshold. 


Measurement of the pp neutrino flux 


The data used for this analysis were acquired from January 2012 to 
May 2013 (408 days of data; Borexino Phase 2). This is the purest data 
set available, and was obtained after an extensive purification campaign 
that was performed in 2010 and 201178 and reduced, in particular, the 
content of *°Kr and 7!°Bi isotopes, which are important backgrounds 
in the low-energy region. 

The pp neutrino rate has been extracted by fitting the measured 
energy spectrum of the selected events in the 165-590 keV energy win- 
dow with the expected spectra of the signal and background components. 
The energy scale in units of kiloelectronvolts is determined from the 
number of struck PMTs, using a combination of calibration data col- 
lected with radioactive sources deployed inside the scintillator” and a 
detailed Monte Carlo model”*. 

The fit is done with a software tool developed for previous Borexino 
measurements” and improved for this analysis to include the descrip- 
tion of the response of the scintillator to mono-energetic electrons, to 
give high statistics; a modified description of the scintillation line- 
width at low energy, providing the appropriate response functions 
widths for o- and B-particles (mainly from the *’°Po and '“C back- 
grounds); and the introduction of the synthetic pile-up. 

The main components of the fit are the solar neutrino signal (the 
dominant pp component and the low-energy parts of the Be, pep and 
CNO components); the dominant '*C background and the associated 

ile-up; and other identified radioactive backgrounds (Kr, 210; 

*°Po and *'“Pb). The free fit parameters are the rates of the pp solar 
neutrinos and of the ®°Kr, 7!°Bi and 7!°Po backgrounds. The 7Be neut- 
rino rate is constrained at the measured value!” within the error, and 
pep and CNO neutrino contributions are fixed at the levels of the SSM”, 
taking into account the values of the neutrino oscillation parameters”. 
The '*C and the synthetic pile-up rates are determined from the data 
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independently and fixed in the fit, allowing for a variation consistent 
with their measured uncertainty. The *'*Pb rate is fixed by the mea- 
sured rate of fast, time-correlated 7‘“Bi(B)—7!*Po(a) coincidences. The 
scintillator light yield and two energy resolution parameters are left free 
in the fit. 

The energy spectrum with the best-fit components is shown in Fig. 3. 
The corresponding values of the fitted parameters are given in Table 1. 

Many fits have been performed with slightly different conditions to 
estimate the robustness of the analysis procedure. In particular, we varied 
the energy estimator, the fit energy range, the data selection criteria and 
the pile-up evaluation method (Methods). The root mean square of the 
distribution of all the fits is our best estimate of the systematic error (7%). 
In addition, a systematic uncertainty (2%) due to the nominal fiducial 
mass determination is added in quadrature; this was obtained from cal- 
ibration data by comparing the reconstructed and nominal positions of 
a (??Rn-'4C) radioactive source located near the border of the fiducial 
volume”’. Other possible sources of systematic errors, like the depend- 
ence of the result on the details of the energy scale definition and on the 
uncertainties in the C and 7!°Bi B-decay shape factors, were investi- 
gated and found to be negligible (Methods). We also verified that vary- 
ing the pep and CNO neutrino rates within the measured or theoretical 
uncertainties changed the pp neutrino rate by less than 1%. We finally 
confirmed that the fit performed without constraining the '“C rate returns 
a “C value consistent with the one previously measured independently 
(see above) and does not affect the pp neutrino result. The systematic 
errors are given in Table 1 for all fitted species. 


y2/d.0.f. = 172.3/147 
— ppv: 144 + 13 (free) — 710B9: 583 + 2 (free) 
— ‘Bev: 46.2 + 2.1 (constrained) —— 14C: 39.8 + 0.9 (constrained) 
— pep v: 2.8 (fixed) —— Pile-up: 321 + 7 (constrained) 
— CNOr: 5.36 (fixed) — *10Bj: 27 + 8 (free) 
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Figure 3 | Fit of the energy spectrum between 165 and 590 keV. a, The best- 
fit pp neutrino component is shown in red, the '“C background in dark 
purple and the synthetic pile-up in light purple. The large green peak is *!°Po 
ot-decays. “Be (dark blue), pep and CNO (light blue) solar neutrinos, and 7!°Bi 
(orange) are almost flat in this energy region. The values of the parameters 


(in c.p.d. per 100 t) are in the inset above the figure. b, Residuals. Error bars, 1o. 
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Table 1 | Results from the fit to the energy spectrum 


Parameter 


Rate + statistical error 
(c.p.d. per 100t) 


Systematic error 
(c.p.d. per 100t) 


pp neutrino 144+ 13 +10 
B5Kr 1+9 +3 
210; 27+8 +3 
210Bg 583 +2 +12 


The best-fit value and statistical uncertainty for each component are listed together with its systematic 
error. The 7? per degree of freedom of the fit is 77/d.o.f. = 172.3/147. 


We note that the very low 85Kr rate (Table 1) is consistent with the 
independent limit (<7 c.p.d. per 100 t, 95% confidence level) obtained 
by searching for the B-y delayed coincidence *°Kr—> ®°""Rb—> *°Rb 
(lifetime of the intermediate metastable isotope, t = 1.46 ts; branch- 
ing ratio, 0.43%). 

We have checked for possible residual backgrounds generated by 
nuclear spallation processes produced by cosmic ray muons that inter- 
act in the detector. We detect these muons with >99.9% efficiency”®. 
We increased the time window for the muon veto from 300 ms to 5 s and 
observed no difference in the results. Furthermore, we searched for other 
possible background due to radioisotopes with sizeable natural abun- 
dances and sufficiently long half-lives to survive inside the detector over 
the timescale of this measurement. These include low-energy «-emitters 
such as ***Rn and *'*Po (both belonging to the radon decay chain), '*’Sm 
and '“°sm, and B-emitters (Be), which are all estimated to be negligible 
and are excluded from the final fit. One B-emitter, 87Rb (half-life, t)/. = 
4.7 X 10'° yr; 28% isotopic abundance; Q = 283.3 keV), is of particular 
concern because of the relatively high abundance of Rb in the Earth’s 
crust. Rubidium is an alkali chemically close to potassium but typically 
2,000-4,000 times less abundant in the crust. Under these assumptions, 
and using the measured {°K (t= 0.125 X 101° yr; 0.0117% isotopic 
abundance) activity in the fiducial volume, that is, <0.4 c.p.d. per 100 t at 
the 95% confidence level'®, the *’Rb activity in the Borexino scintillator 
can be constrained to be much less than 0.1 c.p.d. per 100 t, which is neg- 
ligible for this analysis. A deviation from the crustal isotopic ratio by a factor 
of 100 would still keep this background at ~1 c.p.d. per 100t. 

The solar pp neutrino interaction rate measured by Borexino is 144 + 
13 (stat.) + 10 (syst.) c.p.d. per 100t. The stability and robustness of the 
measured pp neutrino interaction rate was verified by performing fits 
with a wide range of different initial conditions. The absence of pp solar 
neutrinos is excluded with a statistical significance of 10¢ (Methods). 
Once statistical and systematic errors are added in quadrature and the 
latest values of the neutrino oscillation parameters” are taken into ac- 
count, the measured solar pp neutrino flux is (6.6 + 0.7) X 10’°cm™ 7s. 
This value is in good agreement with the SSM prediction’ (5.98 & (1 + 
0.006) X 10'°cm~*s_*). It is also consistent with the flux calculated 
by performing a global analysis of all existing solar neutrino data, in- 
cluding the °B, “Be and pep fluxes and solar neutrino capture rates*”. 
Finally, the probability that pp neutrinos produced in the core of the 
Sun are not transformed into muon or tau neutrinos by the neutrino 
oscillation mechanism is found to be P(v,—> v,) = 0.64 + 0.12, provid- 
ing a constraint on the Mikheyev-Smirnov-Wolfenstein large-mixing- 
angle (MSW-LMA) solution’***** in the low-energy vacuum regime 
(Methods). 


Outlook 


The proton-proton fusion reaction in the core of the Sun is the keystone 
process for energy production in the Sun and in Sun-like stars. The ob- 
servation of the low-energy (0-420 keV) pp neutrinos produced in this 
reaction was possible because of the unprecedentedly low level of radio- 
activity reached inside the Borexino detector. The measured value is in 
very good agreement with the predictions of both the high-metallicity 
and the low-metallicity SSMs. Although the experimental uncertainty 
does not yet allow the details of these models to be distinguished, this 
measurement strongly confirms our understanding of the Sun. Future 
Borexino-inspired experiments might be able to measure solar pp neut- 
rinos with the level of precision (~1%) needed to cross-compare photon 
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and neutrino solar luminosities, while providing insight into solar dynamics 
over 10°-yr timescales. At the same time, such a precise measurement 
of pp neutrinos would yield the ultimate test for the MSW-LMA neut- 
rino oscillation model and allow precision tests for exotic neutrino 
properties”. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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The Borexino detector. The detector (Extended Data Fig. 1) is located deep under- 
ground (3,800 m of water equivalent) at the Gran Sasso laboratory, in central Italy”. 
The active neutrino target, 278 t of ultrapure liquid scintillator (pseudocumene 
(1,2,4-trimethylbenzene) solvent with 1.5 gl” : 2,5-diphenyloxazole (PPO) wave- 
length-shifting fluor), is contained inside a thin transparent nylon spherical vessel 
of 8.5 m diameter. Borexino detects solar neutrinos by measuring the energy depos- 
ited by recoiling electrons following neutrino-electron elastic scattering. The scin- 
tillator promptly converts the kinetic energy of electrons into photons, detected and 
converted into electronic signals (photoelectrons) by 2,212 PMTs mounted on a 
concentric 13.7 m-diameter stainless steel sphere (SSS). The interaction in the scin- 
tillator of ionizing particles, such as those resulting from radioactive decays inside the 
detector (that we call background), can mimic and cover the expected signal. Every 
effort was made to minimize radioactive contamination of the scintillator and all 
surrounding detector materials. (1) The scintillator was produced from crude oil with 
minimal amounts of '*C isotopic contamination (a key feature for this particular 
analysis). (2) Borexino has developed specific purification techniques**”’ for scin- 
tillator, water and all other detector components, implemented innovative cleaning 
procedures and operated a 4 t prototype, the Counting Test Facility”’; all this made 
it possible to reduce the radioactive contamination in ***U and **’Th to less than 
10° '’g per gram of liquid scintillator, a level never reached before”. The Count- 
ing Test Facility also proved that organic liquid scintillator produced from ancient 
and deep underground oil sources can yield a very low *C/'C ratio®®. (3) The vol- 
ume between the nylon vessel and the SSS is filled with 889 t of ultrapure non- 
scintillating fluid, and acts as radiation shield for radioactivity emitted by the PMTs. 
A second, larger (11.5 m diameter) nylon sphere prevents radon and other radio- 
active contaminants from the PMTs and SSS from diffusing into the inner part of 
the detector. The sphere is immersed in a 2,100 t water Cherenkov detector for re- 
sidual cosmic muons, which can induce background via spallation processes with 
the scintillator. 

Solar electron neutrino survival probability. After the SNO experiment in 2001'%, 
the deficit of solar v, observed on Earth has been explained by the neutrino oscillation 
mechanism: these v, undergo lepton flavour transformation into v, or v, in a 
quantum mechanical process requiring neutrinos to have a finite mass difference. 
The exact phenomenon is somewhat more complex (see ref. 35 for a recent review): 
beyond the neutrino oscillation mechanism in vacuum itself, there is a specific fea- 
ture coming from matter effects while the neutrinos travel inside the Sun (the MSW 
effect***). This effect is energy dependent and is fully effective for solar neutrinos 
above ~5 MeV. Below 1 MeV, vacuum oscillations dominate, and a smooth trans- 
ition region is predicted between ~1 and ~5 MeV. One of the parameters of oscil- 
lation is one of the three mixing angles between the neutrino species, for which all 
experimental data (from solar neutrino or reactor antineutrino experiments) iso- 
late one particular solution, called the large-mixing-angle (LMA) solution; the 
model describing the solar neutrino results is now called MSW-LMA. The Borexino 
data, which cover the full solar neutrino spectrum, make it possible to test the pre- 
dictions of the MSW-LMA model in all oscillation regimes with the same detector. 
Extended Data Fig. 2 shows the probability P(v.— v.) that v. produced in the core 
of the Sun are not transformed into v,, or v, via the neutrino oscillation mechanism. 
The violet band corresponds to the 1a prediction of the MSW-LMA solution, using 
the most recent values for the oscillation parameters”’. The survival probability for 
the pp neutrino data from the present paper is P(v, > v,) = 0.64 + 0.12, calculated 
using the formula 


ROP — PMG, 


DM n.(c, = On) 


P(Ve> Ve) 


where R“? = (167 + 19) X 10° c.p.s. per 100 is the experimental rate measured 
by Borexino, &°SM = 5.98 x (1 + 0.006) X 10'!°cm™?s~! is the pp neutrino 
theoretical flux, n, = (3.307 = 0.003) X 10°? is the number of electrons for 100t 
of the Borexino scintillator, and ¢, = 11.38 X 10 “cm? and Oy = 3.22 X 10° *° cm? 
are respectively the v, and v,, integrated cross-sections over the pp neutrino spec- 
trum (which include radiative corrections*”® (also P. Langacker and J. Erler, 
personal communication)). 

All the other data points correspond to the previous Borexino results'”"'°. We 
observe excellent agreement between our results and the MSW-LMA theoretical 
curve. 

The standard solar model and the pp neutrino flux. The predictions for the 
solar neutrino fluxes come from solar models developed by astrophysicists since 
the 1960s. The SSM***' uses the simplest physical hypotheses and the best avail- 
able physics input. It is assumed that energy is generated by nuclear reactions in 
the core of the star (see Extended Data Fig. 3 for the pp cycle) and is transported by 
radiation in the central part and by convection in the outer part. The basic evolution 
equation is the hydrostatic equilibrium between the outward radiative pressure 
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force and the inward gravitational force. One ingredient of the model, the metalli- 
city Z/X (content of heavy elements relative to hydrogen), is under discussion at 
present (see, for example, ref. 9): the most recent determination of Z/X (AGS09) 
gives a lower value than the previous one (GS98), but the corresponding predictions 
significantly disagree with the helioseismology observations. The measurement of 
all solar neutrino components, which differ between solar models in these two meta- 
llicity scenarios, could help resolve the issue. The most recent calculations are 
those of ref. 9 (other models, such as the one described in ref. 24, give similar re- 
sults). The predictions for the pp neutrino flux on Earth are’ 5.98 X (1 + 0.006) X 
10'°cm *s | forthe high-metallicity model and 6.03 X (1 + 0.006) X 10'°cm*s5 ! 
for the low-metallicity model (there is a difference of less than 1%, and we use the 
flux for high metallicity throughout this paper). When combined with the latest 
values of the neutrino oscillation parameters”, the SSM predicts 131 + 2 c.p.d. per 
100 t from pp neutrinos in Borexino. Neutrino oscillation parameters are key for 
translating the interaction rate measured in Borexino into a solar neutrino flux, as 
they provide the relative ratio between electron neutrinos and muon and tau 
neutrinos, which have different elastic scattering cross-section with electrons. 
Event selection, position and energy reconstruction. The basic event selection 
cuts for solar neutrino analysis are (1) no coincidence with muon events”? (a 300 ms 
veto is applied following muons crossing the scintillator and buffer volumes, and a 
2 ms veto following muons crossing only the water tank) and (2) position recon- 
struction within the innermost volume of the detector (the fiducial volume). The 
second cut is necessary to eliminate background from radioactivity in the nylon 
vessels, the SSS and the PMTs; for this analysis, R < 3.021 m, |z| < 1.67 m (the geo- 
metrical centre of the sphere defines the origin of the coordinates x and y (in the 
horizontal plane) and z, as well as the origin from where, R, is measured). 

A signal is recorded when more than a preset number of PMTs (25 until 15 
February 2013, 20 thereafter) detect light within a 100 ns time window. Dedicated 
software then decides whether to classify this as a physics event or archive it as elec- 
tronic noise or other instrumental effect. A ray-tracing algorithm triangulates 
photon arrival times at each struck PMT to determine the position of the event, 
which is assumed to be point-like. 

The procedure to assign the correct energy to each event starts from the number 
of photoelectrons recorded by the PMTs. For B- and y-events, the energy of an 
event is roughly proportional to the number of collected photoelectrons”. Notably, 
a-events display sizeable quenching of scintillation light, description of which is 
beyond the scope of this Article. The number of photons detected for a given type 
of event depends on its position, being maximal at the centre of the detector. We 
correct for this position dependence using a parameterized response obtained by 
calibrating the detector with known radioactive sources deployed at different 
positions. Source calibration data (mostly y-ray emitters) are then used to tune 
Monte Carlo simulation software’, allowing us to model the detector response 
correctly. These are in turn used to determine the energy response of the detector. 
We detect approximately 500 photoelectrons for a 1 MeV electron; this number 
progressively reduces as ageing, malfunctioning PMTs are taken offline. 

A complete description of the data selection and analysis can be found in ref. 28, 
with some conditions relaxed specifically for this analysis. In particular, the energy 
threshold for acceptance of events used in this analysis was lowered with respect to 
previous analyses looking for higher-energy neutrino interactions. Also by con- 
trast with what was done in previous measurements, no [-like selection condition 
(based on the different scintillation time profile of ~- and B-particles) was used, 
as its relative efficiency for neutrino and background events could not be reliably 
evaluated at the lower energies characteristic of solar pp neutrino interactions. 
Fit of the *C. The '*C rate has been determined by looking at a sample of data in 
which the event causing the trigger is followed by a second event within the time 
acquisition window of 16 1s. This particular second-event selection bypasses thresh- 
old effects intrinsic to any self-triggering approach, clearly visible in Extended Data 
Fig. 4. The energy spectrum and its fit, using the theoretical '*C B-emission shape”, 
are shown in Extended Data Fig. 5. The ‘*C spectrum fits the data well, yielding a 
™4C rate of 40 + 1 Bq per 100t. The uncertainty accounts for systematic effects due 
to the stability of the detector response in time, knowledge of the '*C spectral shape 
and fit conditions, including the fit energy range. We note that the relatively poorly 
known shape factor of the '*C B-emission spectrum has a <2% effect on the pp 
neutrino result. The measured “C rate translates into a '*C/'?C isotopic ratio of 
(2.7+0.1) X 10 |*®. The typical value for this ratio for atmospheric (that is, biolo- 
gic) carbon is ~10-'*. '*C is mainly produced by the nuclear reaction *N(n,p)'*C 
supported by cosmic rays in the upper atmosphere. The underground petroleum 
origin of the Borexino liquid scintillator explains its much smaller '*C abundance. 
Study of pile-up events. The pile-up component is determined using a data-driven 
method as follows. The real triggered events without any selection cuts are arti- 
ficially overlapped with random data samples. The combined synthetic events are 
selected and reconstructed using the procedure applied to the regular data. By 
construction, the synthetic pile-up method accounts for all possible event pile-up 
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combinations. The corresponding spectrum includes events that vary in energy with 
respect to the original event (that is, before being artificially overlapped with the 
random sample) by more than a given number of photoelectrons (Nmin). In addition, 
the same event selection criteria as in the real data set are applied after the overlap. 
Thus, some systematic effects, for example the position reconstruction of pile-up 
events, are automatically taken into account. To increase the statistical precision, 


every real event is overlapped four times, each time with a different random data 
sample, and the final pile-up spectrum is divided by a factor of 4. An expanded view 
of the synthetic pile-up spectral probability density function (visible in Figs 2 and 3) 
is presented in Extended Data Fig. 6. 

The final fit was performed using this synthetic pile-up with Nin = 5 photo- 
electrons. This component is dominated but not exclusively composed of '*C 
pile-up with itself. Some of the events are from dark noise of the PMTs. The ro- 
bustness of the method was confirmed by checking that the fit results were not 
dependent on the choice of Nmin- 

We compared the synthetic pile-up method with an alternative one. Regularly 
solicited trigger events with a 16 j1s acquisition time window (acquired at 0.5 Hz) 
are collected and sliced into fixed-time windows of the same duration as the signal 
window (230 and 400ns). The hits in these windows produce the energy distri- 
bution shown in Extended Data Fig. 7, which represents a randomly sampled signal 
from the detector, including contributions from dark noise of the PMTs, MC and 
other radioactive contaminants. Pile-up can be thought of as the combination of 
events belonging to any spectral component combined with such a spectrum. The 
final fit can then be performed without a separate pile-up probability density func- 
tion (as in the synthetic approach), by using ‘smeared’ spectral species in the final fit. 
The smearing is the convolution of ideal spectral components with the solicited 
trigger spectrum. Solar pp neutrino interaction rates measured using this method 
are in full agreement with those obtained with the synthetic pile-up method. 


Stability of the result. The stability and robustness of the measured pp neutrino 
interaction rate was verified by performing fits with a wide range of different initial 
conditions, including fit energy range, synthetic-versus-convolution pile-up spec- 
tral shape, and energy estimator. The distribution of pp neutrino interaction rates 
obtained for all these fit conditions is summarized in Extended Data Fig. 8. The 
possibility of some remaining external background in the fiducial volume has been 
carefully studied in all Borexino solar neutrino analyses (see, for example, refs 17, 
28). From these, we are confident that such a background at energies relevant for 
the pp neutrino study is negligible. In the particular case of the very low-energy part 
of the spectrum, we have tested this confidence by repeating the fit in five smaller 
fiducial volumes (with smaller radial and/or z-cut), which yields very similar 
results. Finally, the goodness (y’) of the spectral fit was computed using different 
values of the pp interaction rate, as shown in Extended Data Fig. 9. The absence of 
pp solar neutrinos is excluded with a statistical significance of 100. 
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Extended Data Figure 1 | The Borexino detector. The characteristic onion- 
like structure of the detector” is displayed, with fluid volumes of increasing 
radiological purity towards the centre of the detector. Although solar 
neutrino measurements are made using events whose positions fall inside the 
innermost volume of scintillator (the fiducial volume, shown as spherical for 
illustrative purposes only), the large mass surrounding it is necessary to 
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Fiducial volume 


shield against environmental radioactivity. The water tank (17 m high) 
contains about 2,100 t of ultraclean water. The diameter of the stainless steel 
sphere is 13.7 m, and that of the thin nylon inner vessel containing the 
scintillator is 8.5 m. The buffer and target scintillator masses are 889 and 278 t, 
respectively. 
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Extended Data Figure 2 | Survival probability of electron-neutrinos 
produced by the different nuclear reactions in the Sun. All the numbers are 
from Borexino (this paper for pp, ref. 17 for “Be, ref. 18 for pep and ref. 19 
for 8B with two different thresholds at 3 and 5 MeV). Be and pep neutrinos are 
mono-energetic. pp and °B are emitted with a continuum of energy, and the 
reported P(v,— v,) value refers to the energy range contributing to the 


measurement. The violet band corresponds to the +1o prediction of 

the MSW-LMA solution’. It is calculated for the °B solar neutrinos, 
considering their production region in the Sun which represents the 
other components well. The vertical error bars of each data point 
represent the +1o interval; the horizontal uncertainty shows the neutrino 
energy range used in the measurement. 
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Extended Data Figure 3 | The sequence of nuclear fusion reactions defining the pp chain in the Sun. The pp neutrinos start the sequence 99.76% of the time. 
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Extended Data Figure 4 | Study of the low energy part of the spectrum. The threshold effect for self-triggered events (black) is clear. The residual 
Comparison of the spectrum obtained with the main trigger (black) and by threshold effect at lower energy in the red curve is due to the finite efficiency for 


selecting events falling in the late part of the acquisition window triggered by _ identifying very low-energy events within a triggered data window. 
preceding events (red). Above 45 struck PMTs, the spectral shapes coincide. 
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Extended Data Figure 5 | uC spectrum, and residuals, obtained fromevents —_ with the zC B-emission spectrum (in units of standard deviations). The error 
triggered by a preceding event. a, Spectrum. b, Relative residuals of a fit bars thus represent + 1c intervals. 
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standard cuts. The small bump around 150 struck PMTs (~400 keV in dominated by '“C+"4C, and by “C+ dark noise. 
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Extended Data Figure 7 | Energy distribution of events collected with no randomly sampled. In an alternative treatment of pile-up, this spectrum is used 
threshold applied. The events correspond to regular, solicited triggers (sliced to smear each spectral component used in the fit (see text for details). 
into 230 ns windows). This represents what the detector measures when 
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Extended Data Figure 8 | Distribution of best-fit values for the pp neutrino _ energy estimator. The distribution shown is peaked around our reported value 
interaction rate. Values are obtained by varying the fit conditions, including of 144 c.p.d. per 100t. 
the fit energy range, synthetic-versus-analytic pile-up spectral shape, and 
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Extended Data Figure 9 | Goodness of fit versus pp neutrino interaction rate. The y’ minimum is at our reported value of 144c.p.d. per 100t. 
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AhR sensing of bacterial pigments 
regulates antibacterial defence 
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The aryl hydrocarbon receptor (AhR) isa highly conserved ligand-dependent transcription factor that senses environmental 
toxins and endogenous ligands, thereby inducing detoxifying enzymes and modulating immune cell differentiation and 
responses. We hypothesized that AhR evolved to sense not only environmental pollutants but also microbial insults. We 
characterized bacterial pigmented virulence factors, namely the phenazines from Pseudomonas aeruginosa and the 
naphthoquinone phthiocol from Mycobacterium tuberculosis, as ligands of AhR. Upon ligand binding, AhR activation 
leads to virulence factor degradation and regulated cytokine and chemokine production. The relevance of AhR to host 
defence is underlined by heightened susceptibility of AhR-deficient mice to both P. aeruginosa and M. tuberculosis. Thus, 
we demonstrate that AhR senses distinct bacterial virulence factors and controls antibacterial responses, supporting a 
previously unidentified role for AhR as an intracellular pattern recognition receptor, and identify bacterial pigments as a 


new Class of pathogen-associated molecular patterns. 


The aryl hydrocarbon receptor (AhR) is a ligand-dependent transcrip- 
tion factor that senses environmental toxins, notably halogenated and 
non-halogenated polycyclic aromatic hydrocarbons, such as 2,3,7,8- 
tetrachlorodibenzo-p-dioxin (TCDD)’. Upon activation, AhR enters the 
nucleus and, together with aryl hydrocarbon receptor nuclear translocator 
(ARNT), binds dioxin-responsive elements. This leads to transcriptional 
activation of target genes including its own negative regulator, the AhR 
repressor (AHRR), and detoxifying monooxygenases, including CYP1A1 
and CYPIBI (refs 1, 2). These enzymes degrade some ligands to meta- 
bolites with decreased activity and increased water solubility, thus facil- 
itating their removal’. Recent interest has shifted to endogenous AhR 
ligands, including tryptophan (Trp) degradation products, such as kynur- 
enines and 6-formylindolo[3,2-b] carbazole (FICZ)’. Although down- 
stream events of AhR activation by environmental toxins, including cell 
cycle control, apoptosis and haematopoiesis*’, have been intensively 
studied, the biological mechanisms regulated by natural AhR ligands 
are incompletely understood. Endogenous and dietary AhR ligands may 
play a role in immune control®’, for example, through transcriptional 
regulation of several pro-inflammatory cytokines*"®. A role of AhR in T 
cell polarization’**, accumulation of innate lymphoid cells and protection 
against Citrobacter rodentium infection has been revealed recently”'*"*. 
We demonstrate that AhR (1) binds and senses distinct pathogen- 
associated molecular patterns (PAMPs); (2) regulates immune and deg- 
radation pathways in myeloid and epithelial cells in response to these 
PAMPs, and (3) plays a crucial role in host defence against bacterial 
pathogens producing AhR ligands. Thus, by integrating pathogen recog- 
nition and regulating effector functions (that is, detoxification as well as 


induction of protective immunity), AhR plays a key role in innate 
defence against bacteria. 


AhR senses pigmented virulence factors 

We screened for AhR ligands of bacterial origin by molecular modelling. 
Pigmented virulence factors from pulmonary pathogens, that is, the phen- 
azines from P. aeruginosa such as pyocyanin (Pyo), 1-hydroxyphenazine 
(1-HP), phenazine-1-carboxylic acid (PCA) and phenazine-1-carboxamide 
(PCN), as well as the naphthoquinone phthiocol (Pht) from M. tuberculosis 
were predicted to bind AhR. Their structures show features similar to 
TCDD (Fig. 1a and Extended Data Fig. 1a)'’, and their low molecular 
weights allow them to diffuse and permeate cell membranes’*””. In vitro 
studies revealed toxic effects of reactive oxygen intermediates generated 
by Pht”, similar to P. aeruginosa phenazines”. Additionally, phenazines 
exert various effects on host cells (for example, direct damage, cytokine 
production, inhibition of ciliary motion and mucus production)?” 
and their abundance negatively correlates with lung function in cystic 
fibrosis patients”®. 

As the three-dimensional structure of AhR remains unknown, its 
interactions with these molecules were interrogated by in silico docking 
studies. The hydrophobic AhR binding pocket contains proton donors, 
whereas the pigments show two proton acceptors and fit well into the 
binding pocket (Fig. 1b, Extended Data Fig. 1b and Supplementary In- 
formation). In subsequent experiments we focused on Pyo, 1-HP and 
Pht. 

To verify our in silico predictions, we evaluated ligand binding to 
AhR by intrinsic fluorescence quenching or a standard competition assay 
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Neonatology, OE 6710, Hannover Medical School, Carl-Neuberg-Str. 1, 30625 Hannover, Germany. “Institute of Pharmacology and Structural Biology (IPBS), CNRS and University of Toulouse (Toulouse III), 
205 Route de Narbonne, 31077, Toulouse cedex 04, Toulouse, France. ®Microscopy Core Facility, Max Planck Institute for Infection Biology, Department of Immunology, Charitéplatz 1, 10117 Berlin, 
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Figure 1 | Bacterial pigmented virulence factors activate AhR. a, Structures 
of 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD), pyocyanin (Pyo), 
1-hydroxyphenazine (1-HP) and phthiocol (Pht). b, In silico docking into the AhR 
ligand-binding pocket (yellow). Hydrophilic residues (magenta), aromatic and 
hydrophobic residues (green). ¢, Intrinsic tryptophan fluorescence quenching of 


using the radioactive ligand TCDD*. In the first case, a purified construct 
comprising only the major domains was used (amino acids 1 to 417, 
AhbRj_417), in the second the intact full-length receptor. Hyperbolic fitting 
of AhRj_417 fluorescence changes titrated with TCDD and 1-HP yielded 
dissociation constant (Kq) values of 11.1 and 23.5 nM, respectively (Fig. 1c 
and Extended Data Fig. 1c). Due to the fluorescence overlap of Pyo and 
Trp, binding of Pyo to AhR was not studied by ultraviolet but by concen- 
tration-dependent displacement of PHITCDD from wild-type (WT) 
mouse liver cytosol extracts (Fig. 1d)°. TCDD showed an inhibition con- 
stant K, = 40 nM (95% confidence limits: 15-107 nM), whereas the K;, of 
Pyo was 5.4 [1M (95% confidence limits: 2.0-15.1 1M). Binding of Pht to 
AhR could not be measured with this assay due to its sensitivity towards 
de-oxidation by dithiothreitol required for stable assay conditions. 

Ligand-induced AhR activation was verified by a luciferase reporter 
assay with human macrophages (THP-1), type II pneumocytes (A549) 
and bronchiolar Clara (H358) cell lines (Fig. le and Extended Data 
Fig. 1d, e). All pigments activated AhR in a dose-dependent manner 
(Extended Data Fig. 1f). Concentrations of up to 100 UM of Pyo have 
been detected in pulmonary secretions of cystic fibrosis patients infected 
with P. aeruginosa*®”’. Pht concentrations of 50 [1M have been estimated 
to be present in M. tuberculosis”. Thus bacterial ligands were used at 
concentrations of 50 11M in further experiments, which did not affect 
cell viability (Extended Data Fig. 1g). AhR was not activated upon sti- 
mulation of THP-1 AhR reporter cells with known Toll-like receptor 
(TLR) ligands (Extended Data Fig. 1h). Bacterial pigments significantly 
induced transcription of hallmark genes of AhR activation, including 
CYPIAI, CYP1B1 and AHRR (Extended Data Fig. 2a). Silencing of AhR 
(knockdown KD, AhR-KD cells; Extended Data Fig. 2b) decreased 
CYPIAI induction after ligand stimulation (Fig. 1f and Extended Data 
Fig. 2c). Consistently, microarray analysis of A549 cells stimulated with 
bacterial pigments revealed differential regulation of the AhR pathway 
(Extended Data Fig. 3a, b and Supplementary Tables 1 and 2). AhR can be 
activated indirectly via inhibition of CYP1A1 and increased abundance of 
Trp metabolite FICZ”*. However, AhR activation was still observed in 
absence of Trp in the medium (Extended Data Fig. 4a) and AHRR was 
induced upon stimulation in both Scramble and CYP1A1-KD THP-1 
cells (Extended Data Fig. 4b-d). Hence, AhR was activated by these pig- 
ments independently of CYP1A1 inhibition. In summary, AhR directly 
senses pigmented bacterial virulence factors thereby mobilizing the de- 
toxification pathway. 


AhR-P.aeruginosa crosstalk via phenazines 


Infection of the THP-1 AhR reporter cell line with the laboratory strain 
P. aeruginosa PAOI or stimulation with filtered supernatants from 
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purified AhR,.4)7. d, Displacement of [H]TCDD from AhR in WT mouse liver 
cytosol. e, AhR reporter cells stimulated for 24h. f, RT-PCR of CYPI1A1 at 4h 
(THP-1) and 24h (A549) after stimulation. c, d, Results of two experiments; 

e, f, cumulative data of at least two experiments, mean + s.e.m.; e, f, Student’s 
t-test. *P < 0.05; **P< 0.01; ***P < 0.001; ****P < 0.0001. 


PAOI cultures activated AhR (Fig. 2a, b). Mutants of the P. aeruginosa 
PA14 strain overexpressing Pyo and/or all phenazines were consistently 
stronger activators than the parental WT strain (PA14-WT1) (Fig. 2c, d 
and Extended Data Fig. 5a—c and Supplementary Table 3). Supernatants 
of the PA14 Aphz1/2 strain, which lack the production ofall phenazines”, 
induced significantly lower AhR activation compared to supernatants of 
the parental WT (PA14 WT2) (Fig. 2e, fand Extended Data 5a,d). These 
data define phenazines as major AhR activators in P. aeruginosa. 

Degradation of Pyo hasbeen described asa host defence stratagem against 
P. aeruginosaalthough the underlying mechanisms remain controversial”. 
Pyo degradation was observed in A549 cells and knockdown of AhR 
reduced Pyo degradation, as determined by spectrophotometry (Fig. 2g) 
and HPLC analysis (Fig. 2h and Extended Data Fig. 6a). Similar results 
were obtained for M. tuberculosis Pht (Extended Data Fig. 6b). Thus, sens- 
ing of bacterial pigments by AhR caused their direct degradation. AhR 
regulates the expression of several xenobiotic metabolizing enzymes 
(XME)**?. Microarray analysis of A549 cells stimulated with bacterial 
pigments identified Phase I XMEs (CYP1A1 and CYP1B1) amongst the 
strongest upregulated genes in Scramble as compared to AhR-KD cells 
(Extended Data Fig. 6c and Supplementary Tables 4 and 5). Although 
direct detoxification of environmental toxins is a well-described mech- 
anism, its relevance in immune defence against pathogens has been lar- 
gely ignored. 


AhR in P. aeruginosa infection 


We used Ahr mice to determine AhR contribution to host defence against 
pulmonary infection with the opportunistic pathogen P. aeruginosa’’****, 
Ahr“ mice developed severe disease and respiratory distress after infec- 
tion with PAO] (Fig. 3a and Extended Data Fig. 7a, b). Lung bacterial 
loads and tissue damage were consistently exacerbated in Ahr’ mice 
(Fig. 3b, c and Extended Data Fig. 7c) which succumbed to P. aeruginosa 
infection more rapidly than WT mice (Fig. 3d). Absence of AhR fostered 
alveolar disruption, vascular leakage and deposition of bacterial micro- 
colonies in the alveoli (Fig. 3e). Neutrophils are essential for early defence 
against P. aeruginosa™. Their numbers were significantly reduced in lung 
tissue and bronchoalveolar lavage fluid (BALF) of Ahr” mice promptly 
post-infection (p.i.) (Fig. 3f and Extended Data Fig. 8a, b), consistent with 
reduced cell infiltration (Fig. 3e). Abundance of myeloid cells in uninfec- 
ted Ahr’ versus WT mice and lymphoid cell frequencies in infected 
Arh” versus WT mice did not differ significantly (Extended Data 
Fig. 8c-e). Frequencies of necrotic and apoptotic leukocytes in BALF 
from WT and Ahr” mice were comparable, suggesting defective recruit- 
ment rather than increased cell death in infected Ahr” mice (Fig. 3g). 
Accordingly, concentrations of neutrophil chemoattractants*’, CXCL1 
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The canonical AhR pathway was activated in leukocytes (similar pro- 
portions of neutrophils and inflammatory macrophages (Extended Data 
Fig. 8b)), derived from BALF of P. aeruginosa-infected mice (Fig. 4c and 
Extended Data Fig. 8g, h). Microarray analysis unveiled AhR-dependent 
differentially regulated genes and functions (Supplementary Tables 6-9). 
AhR-regulated pro-inflammatory cytokine and chemokine genes were 


Figure 4 | Haematopoietic and non-haematopoietic cells contribute to 
AhR-mediated defence against P. aeruginosa. a, Bone marrow chimaeras and 
b, Lysm-Cre X Ahr, WT controls (Ahr) and Ahr’ mice were infected 
with 10’ c.f.u. of P. aeruginosa PAOL. c, d, RT-PCR analysis of leukocytes. 
e, (RT-PCR of Cxcll and Cxcl5 in pneumocytes stimulated for 4h. f, g, Periodic 
Acid Schiff (PAS) staining of lungs; PAS™ cells are indicated by asterisks. Scale 
bars, 100 UM (left), 30 uM (right). a, One experiment; b, cumulative data of 
two experiments; c, d, representative of two experiments, mean + s.d.; 

e, cumulative data of three experiments, mean + s.e.m.; g, cumulative data of 
two experiments (naive WT n = 8, Ahr“ n = 5; PAOI WT n = 12, Ahr 
n= 12), mean + s.e.m, a, b, Log-rank test; c-e, Student’s t-test; g, two-way 
ANOVA. a, WT> Ahr” vs WT->WT P< 0.05; Ahr” WT vs WT>WT 
P<0.01; Ahr” WT vs WT Ahr“ NS; b, St Cre X Ahr ys Ahr 
P<0.01; Lysm-Cre X Ahr®™ vs Ahr” NS; Ahr®™ vs Ahr” P<0.01. 

*P < 0.05; **P < 0.01; ***P < 0.001; ****P < 0.0001. ND, not detected; NS, 
not significant. 


confirmed by qRT-PCR (Fig. 4d). A crosstalk between AhR and other 
signalling pathways has been described**””, probably accounting for a 
more complex regulation of these genes. 

Respiratory epithelial cells are well positioned for prompt initiation 
of antibacterial defence through cytokines and chemokines“. Induction 
of neutrophil chemoattractants in mouse type II pneumocytes (T7 cell 
line) by Pyo/1-HP depended on AhR expression (Fig. 4e and Extended 
Data Fig. 2b). Epithelial cells also serve as primary barrier against patho- 
gen invasion by producing mucus and surfactants”. A role for AhR in 
mucus production in response to TCDD has been addressed"". P. aeru- 
ginosa induced mucus production in bronchiolar and bronchial cells of 
WT mice, whereas mucus-positive cells were hardly detected in Ahr” 
mice (Fig. 4f, g). We conclude that AhR sensing of bacterial pigments 
promptly triggers natural and innate defence mechanisms in myeloid 
and parenchymal cells, as exemplified by induction of inflammatory 
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Figure 6 | AhR mediates macrophage activation and mycobacterial growth 
inhibition. a, AhR reporter-infected THP-1 cells. b-d, Bone-marrow-derived 
macrophages (BMDMs) were infected with M. tuberculosis (multiplicity of 
infection (m.o.i.) = 5). b, Uptake of H37Rv-GFP at 4h p.i. c, Bacterial growth. 
d, Cytokines in supernatants at 24h p.i. a~-d Cumulative data of at least 

two experiments, mean + s.e.m.; a, one-way ANOVA; b, Mann-Whitney 
U-test; c, d, two-way ANOVA. *P < 0.05; **P < 0.01; ***P < 0.001; 

*#**D < 0.0001. 


cytokines, neutrophil recruitment to, and mucus production at, the site 
of infection. 


AbR in M. tuberculosis infection 


To corroborate the generality of AhR sensing in pulmonary infections, 
we determined its contribution to host defence against the persistent 
intracellular pathogen M. tuberculosis in vivo. Ahr’~ mice succumbed 
to low-dose aerosol infection, but only at late time points (= 100 days) 
(Fig. 5a). Yet, by 7 days post-infection, bacterial load was significantly 
elevated in lung, liver and spleen of Ahr’ mice (Fig. 5b, c), pointing to 
a critical role of AhR in control of innate pulmonary defence and in 
preventing dissemination of M. tuberculosis. No significant differences 
in tissue damage were observed (Extended Data Fig. 9a, b), although a 
trend towards elevated numbers of inflammatory macrophages was 
observed in lungs of Ahr” versus WT mice at day 7 post-infection 
(Fig. 5d). Abundance of TNF-o was decreased (Fig. 5e), and macrophage 
chemoattractants CCL2, CCL3 and CCL5 were increased in Ahr“ lung 
homogenates (Fig. 5e), correlating with the kinetics of the mononuc- 
lear phagocytes. Lymphocyte numbers (Extended Data Fig. 9c), and 
concentrations of most other cytokines and chemokines did not differ 
(Extended Data Fig. 9d). 

Coherent with a decisive role for AhR in early tuberculosis, after 
high-dose aerosol M. tuberculosis infection, mortality of Ahr” versus 
WT mice was exacerbated (Fig. 5a). In infected bone marrow chimaeras, 
resistance was increased in all experimental groups over Ahr’ —> Ahr” 
mice (Fig. 5f) and no statistically significant differences were observed 
between WI->WT versus WT-> Ahr” (Fig. 5f). Ameliorated survival 
of Ahr’—> WT over Ahr’ —> Ahr“ mice point to an additional role 
of AhR sensing by non-haematopoietic cells. Consistent with a central 
role of AhR in the hematopoetic compartment, survival did not differ 
significantly between Lysm-Cre X Ahr versus Ahr’ mice (Fig. 5g). 
The critical role of myeloid cells in AnR-dependent host defence against 
tuberculosis is congruent with a critical function of macrophages in 
this infection”. 

AhR-dependent luciferase activity was induced in M. tuberculosis 
and M. bovis BCG-infected macrophage reporter cells (Fig. 6a). Fur- 
thermore, AhR-regulated genes were upregulated upon infection of pri- 
mary human macrophages with M. tuberculosis (Extended Data Fig. 9e). 
Internalization of M. tuberculosis by Ahr’ bone-marrow-derived 
macrophages (BMDMs) remained unaffected (Fig. 6b), but mycobac- 
terial growth inhibition and secretion of the pro-inflammatory cytokines 
TNF-o, IL-12p40, and IL-6 were impaired in Ahr” BMDMs (Fig. 6c, d). 
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Intriguingly, Pht was detected in lipid fractions of M. tuberculosis, 
which activated AhR, while non-activating lipid fractions were devoid 
of Pht (Extended Data Fig. 10). Although we are confident that Pht sen- 
sing by AhR is critical for prompt mobilization of host defence in 
tuberculosis, we cannot exclude the existence of other mycobacterial 
or endogenous ligands, notably at later stages of tuberculosis”. 


Discussion 


Our findings extend the list of AhR ligands beyond environmental tox- 
ins, endogenous molecules and dietary components to bacterial viru- 
lence factors. We demonstrate that: (1) AhR binds bacterial pigments 
comprising a redox-cycling phenazine/naphthoquinone moiety, namely, 
P. aeruginosa Pyo, 1-HP, and probably PCA and PCN, as well as M. 
tuberculosis Pht; (2) AhR sensing leads to transcription of canonical 
detoxifying genes as well as regulation of cytokine and chemokine pro- 
duction; (3) AhR activation regulates inflammatory leukocyte recruit- 
ment to the infected lung and control of bacterial replication; and (4) AhR 
plays a central role in defence against both acute and chronic bacterial 
infections. 

The family of phenazines represents an immense class of secondary 
metabolites present only in prokaryotes“ but not restricted to Pseudomonas 
spp.'****°, Naphthoquinones are broadly distributed in prokaryotes, 
plants and animals”. As a corollary, we speculate that the broad distri- 
bution of these pigments has served as a driving force for AhR evolution 
in host defence against exogenous insult. Concepts of how the innate 
immune system activates host defence against bacteria have focused on 
direct attack of invading pathogens, followed by instruction of acquired 
immunity**”. In contrast, maintenance of host integrity threatened by 
environmental toxins has emphasized rapid detoxification. Our data 
draw attention to the need for a conceptual convergence of both types 
of responses for pathogens producing virulence factors, which directly 
affect host cells. The AhR is well equipped for both functions by con- 
comitantly mobilizing virulence factor neutralization and bacterial erad- 
ication. Sensing of PAMPs by pattern recognition receptors (PRRs) 
expressed by immune cells, notably TLR cognates, alerts the host of in- 
vading pathogens**”’. Here, we propose bacterial pigments as a new class 
of PAMPs, and as a corollary, AhR as a new PRR. Defining AhR as a 
PRR, which senses PAMPs (phenazines/naphthoquinones as shown 
here) and self-components associated with inflammation—also termed 
danger-associated molecular patterns (DAMPs)—indicates biological 
similarities rather than disparities between the two types of surveillance 
mechanisms in control of bacterial infections. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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Animal transcriptomes are dynamic, with each cell type, tissue and organ system expressing an ensemble of transcript 
isoforms that give rise to substantial diversity. Here we have identified new genes, transcripts and proteins using 
poly(A)* RNA sequencing from Drosophila melanogaster in cultured cell lines, dissected organ systems and under 
environmental perturbations. We found that a small set of mostly neural-specific genes has the potential to encode 
thousands of transcripts each through extensive alternative promoter usage and RNA splicing. The magnitudes of splicing 
changes are larger between tissues than between developmental stages, and most sex-specific splicing is gonad-specific. 
Gonads express hundreds of previously unknown coding and long non-coding RNAs (IncRNAs), some of which are 
antisense to protein-coding genes and produce short regulatory RNAs. Furthermore, previously identified pervasive 
intergenic transcription occurs primarily within newly identified introns. The fly transcriptome is substantially more 
complex than previously recognized, with this complexity arising from combinatorial usage of promoters, splice sites and 


polyadenylation sites. 


Next-generation RNA sequencing (RNA-seq) has permitted the map- 
ping of transcribed regions of the genomes of a variety of organisms’. 
These studies demonstrated that large fractions of metazoan genomes 
are transcribed, and they also catalogued individual elements of tran- 
scriptomes, including transcription start sites’, polyadenylation sites*”, 
exons and introns®. However, the complexity of the transcriptome arises 
from the combinatorial incorporation of these elements into mature 
transcript isoforms. Studies that inferred transcript isoforms from short- 
read sequence data focused on a small subset of isoforms, filtered using 
stringent criteria”*. Studies using complementary DNA (cDNA) or ex- 
pressed sequence tag (EST) data to infer transcript isoforms have not 
had sufficient sampling depth to explore the diversity of RNA products 
at most genomic loci’. Although the human genome has been the focus 
of intensive manual annotation”, analysis of strand-specific RNA-seq 
data from human cell lines reveals over 100,000 splice junctions not 
incorporated into transcript models"’. Thus, a large gap exists between 
genome annotations and the emerging transcriptomes observed in next- 
generation sequence data. In Drosophila, we previously described a non- 
strand-specific poly(A)" RNA-seq analysis of a developmental time 
course through the life cycle® and cap analysis of gene expression (CAGE) 
analysis of the embryo”, which discovered thousands of unannotated 
exons, introns and promoters, and expanded coverage of the gnome 
by identified transcribed regions, but not all elements were incorpo- 
rated into full-length transcript models. Here we describe an expansive 
poly(A)" transcript set modelled by integrative analysis of transcription 


start sites (CAGE and 5’ rapid amplification of cDNA ends (RACE)), 
splice sites and exons (RNA-seq), and polyadenylation sites (3’ expressed 
sequence tags (ESTs), cDNAs and RNA-seq). We analysed poly(A)* 
RNA data from a diverse set of developmental stages®, dissected organ 
systems and environmental perturbations; most of this data is new and 
strand-specific. Our data provide higher spatiotemporal resolution and 
allow for deeper exploration of the Drosophila transcriptome than was 
previously possible. Our analysis reveals a transcriptome of high com- 
plexity that is expressed in discrete, tissue- and condition-specific mes- 
senger RNA and IncRNA transcript isoforms that span most of the 
genome and provides valuable insights into metazoan biology. 


A dense landscape of discrete poly(A)* transcripts 


To broadly sample the transcriptome, we performed strand-specific, 
paired-end sequencing of poly(A)* RNA in biological duplicate from 
29 dissected tissue samples including the nervous, digestive, reproduct- 
ive, endocrine, epidermal and muscle organ systems of larvae, pupae 
and adults. To detect RNAs not observed under standard conditions, we 
sequenced poly(A) * RNA in biological duplicate from 21 whole-animal 
samples treated with environmental perturbations. Adults were chal- 
lenged with heat-shock, cold-shock, exposure to heavy metals (cadmium, 
copper and zinc), the drug caffeine or the herbicide paraquat. To deter- 
mine whether exposing larvae resulted in RNA expression from prev- 
iously unidentified genes, we treated them with heavy metals, caffeine, 
ethanol or rotenone. Finally, we sequenced poly(A)* RNA from 21 
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previously described”’ and three ovary-derived cell lines (Supplementary 
Methods). In total, we produced 12.4 billion strand-specific read pairs 
and over a terabase of sequence data, providing 44,000-fold coverage of 
the poly(A)* transcriptome. 

Reads were aligned to the Drosophila genome as described’, and full- 
length transcript models were assembled using our custom pipeline 
termed GRIT", which uses RNA-seq, poly(A) “seq, CAGE, RACE”, 
ESTs” and full-length cDNAs” to generate gene and transcript models 
(Supplementary Methods). We integrated these models with our own 
and community manual curation data sets to obtain an annotation (Sup- 
plementary Information, section 12) consisting of 304,788 transcripts 
and 17,564 genes (Fig. la and Supplementary Fig. 1), of which 14,692 
are protein-coding (Supplementary Data 1 and updates available at 
http://fruitfly.org). Ninety per cent of genes produce at most 10 tran- 
script and five protein isoforms, whereas 1% of genes have highly com- 
plex patterns of alternative splicing, promoter usage and polyadenylation, 
and may each be processed into hundreds of transcripts (Fig. 1a, b). 
Our gene models span 72% of the euchromatin, an increase from 65% 
in FlyBase 5.12 (FB5.12), the reference annotation at the beginning of 
the modENCODE project (Supplementary Table 1 compares annota- 
tions in 2008-13). There were 64 euchromatic gene-free regions longer 
than 50 kb in FB5.12, and 25 remaining in FB5.45. Our annotation 
includes new gene models in each of these regions. Newly identified 
genes (1,468 total) are expressed in spatially and temporally restricted 
patterns (Supplementary Fig. 2), and 536 reside in previously unchar- 
acterized gene-free regions. Others map to well-characterized regions, 
including the ovo locus, where we discovered a new ovary-specific, 
poly(A) * transcript (Mgn94020, Supplementary Data 1 and 2), extending 


from the second promoter of ovo on the opposite strand and spanning 
107 kb (Fig. 1c). Exons of 36 new genes overlap molecularly defined 
mutations with associated phenotypes (genome structure correction 
(GSC) P value ~0.0002), indicating potential functions (Supplemen- 
tary Table 2). For example, the lethal P-element insertions /(3)L3051 
and /(3)L4111 (ref. 17) map to promoters of Mgn095159 and Mgn95009, 
respectively, indicating these may be essential genes. Nearly 60% of the 
intergenic transcription we previously reported® is now incorporated 
into gene models. 


Transcript diversity 


Over half of spliced genes (7,412; 56%) encode two or more transcript 
isoforms with alternative first exons. Most of such genes produce alterna- 
tive first exons through coordinated alternative splicing and promoter 
usage (59%, 4,389 genes, hypergeometric P value < 1 X 10° '°); how- 
ever, a substantial number of genes use one, but not both mechanisms 
(Fig. 2a). Only 1,058 spliced genes have alternative first exons that alter 
protein-encoding capacity and increase the complexity of the predicted 
proteome. Some genes, such as G protein B-subunit 13F (GB 13F, Fig. 2b 
and Supplementary Fig. 3) have exceptionally complex 5' UTRs, but 
encode a single protein. 

We measured splicing efficiency using the ‘per cent spliced in’ (‘P) 
index—the fraction of isoforms that contain the particular exon®. Introns 
flanked by coding sequence are retained at an average ‘V = 0.7, whereas 
introns flanked by non-coding sequence are retained > fivefold more 
often, with an average ¥ = 3.8(P<1X 10 '° subsampling/two-sample 
t-test), and is most frequent in 5’ UTRs (mean P = 5.1, Fig. 2c). 
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Figure 1 | Overview of the annotation of the Drosophila melanogaster 
transcriptome. a, Scatterplot showing the per gene correlation between 
number of proteins and number of transcripts. The genes Dscam and para 
are omitted as extreme outliers both encoding >10,000 unique proteins. 
b, Dystrophin (Dys) produces 72 transcripts and encodes 32 proteins. 
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Highlighted is alternative splicing and polyadenylation at the 3’ end. CAGE 
(black), RNA-seq (tan, blue), splice junctions (shaded grey as a function of 
usage). ¢, An internal promoter of ovo is bidirectional in ovaries and produces a 
IncRNA (430 bp, red) bridging two gene deserts. CAGE (black), RNA-seq 
(pink), counts are read-depth (minus-strand given as negative). 
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Figure 2 | Splicing complexity across the gene body. a, Alternative first exons 
occur in two main configurations: multiple transcription start sites (TSS, pink) 
and multiple donor sites (DS, light blue). A subset of the genes in the multiple 
TSS category produce transcripts with different TSSs and shared donor sites 
(red), and a subset of the genes in the DS category produce transcripts with a 
shared TSS and different donor sites (blue). Some genes in the multiple TSS 
category directly affect the encoded protein (maroon), and similarly for DS 
(dark blue). The overlap of configurations is radially proportional (units 
indicate percentage of all spliced genes). b, Poly(A) * testes (blue) and central 
nervous system (CNS) (orange) stranded RNA-seq of Gf13F showing complex 
processing and splicing of the 5’ UTR. An expansion of the 5’ UTR showing 
some of the complexity. Transcription of the gene initiates from one of three 
different promoters (green arrows) terminates at one of ten possible poly(A)~ 


Despite the depth of our RNA-seq, these data show that 42% of genes 
encode only a single transcript isoform, and 55% encode a single protein 
isoform (Supplementary Methods). In mammals, it has been estimated 
that 95% of genes produce multiple transcript isoforms’*””, (estimates 
for protein-coding capacity have not been reported). 

The majority of transcriptome complexity is attributable to forty- 
seven genes that have the capacity to encode > 1,000 transcript isoforms 
each (Supplementary Table 3), and account for 50% of all transcripts 
(Fig. 3a). Furthermore, 27% of transcripts encoded by these genes were 
detected exclusively in samples enriched for neuronal tissue, and another 
56% only in the embryo (83% total). To determine their tissue specifi- 
cities we conducted embryonic in situ expression assays (Fig. 3b) and 
found that 18 of 35 are detected only in neural tissue (51% compared 
with 10% genome-wide, hypergeometric P value <1 107°, Sup- 
plementary Table 4). Of these genes, 48% have 3’ UTR extensions in 
embryonic neural tissue” (5% genome-wide, P < 1X 10 '°). Furthermore, 


addition sites (from adult head poly(A) *seq, red) and generates 235 transcripts. 
The first exon has two alternative splice acceptors that splice to one of eleven 
different donor sites. Only five donor sites are shown owing to the proximity of 
splice sites. Four splice donors are represented by the single red line differing 
by 12, 5 and 19 bp, respectively. Three splice donors are represented by the 
single green line differing by 12 and 11 bp. Two splice donors are represented by 
the single purple line differing by 7 bp. These splice variants are combined with 
four proximal internal splices (Supplementary Fig. 3a) to generate the full 
complement of transcripts. c, Intron retention rates (‘) across the gene body. 
The genome-wide mean lengths of exons and introns are connected by red 
parabolic arcs, which illustrate the upper and lower quartiles of intron retention 
(across all samples) for introns retained at or above 20 ¥ in at least one sample. 


44% are targets of RNA editing (4% genome-wide’, P<1 X10 *°, 
with 18 of 21 validated*'), and 21% have 3’ UTR extensions and 
RNA editing sites (10 of 65 genome-wide, P< 1X 10 '°°). The capa- 
city to encode thousands of transcripts is largely specific to the nervous 
system and coincides with other classes of rare, neural-specific RNA 
processing. 


Tissue- and sex-specific splicing 

To examine the dynamics of splicing, we calculated switch scores or 
A, for each splicing event by comparing the maximal and minimal 
YY values across all samples, and in subsets including just the devel- 
opmental and tissue samples. In contrast to the median ¥ values, the 
distribution of AY? values is strikingly different between the develop- 
mental and tissue samples. Among the developmental samples, 38% 
of events have a A‘¥ = 50%, whereas between the tissue samples 63% 
of events have a AY = 50%. This difference is even more pronounced 
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Figure 3 | Complex splicing patterns are mainly limited to neural tissues. 
a, A small minority of genes (47, 0.2%) encode most transcripts. b, In situ RNA 
staining of constitutive exons of four genes with highly complex splicing 
patterns in the embryo. Syncrip (Syp), CAP, retinal degeneration A (rdgA) and 


GluCla show specific late embryonic neural expression in the ventral midline 
neurons; dorsal/lateral and ventral sensory complexes; Bolwig’s organ or larval 
eye; and central nervous system, respectively. 


28 AUGUST 2014 | VOL 512 | NATURE | 395 


©2014 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


at higher AV thresholds—only 6% of events have a A’ = 80% between 
the developmental samples, whereas 31% of events have a AY = 80% 
between the tissue samples. Thus, most splicing events are highly tissue- 
specific. Of the 17,447 alternative splicing events analysed (Supplemen- 
tary Information, section 19), we find that 56.6% changed significantly 
(AY > 20%, Bayes factor >20). Clustering revealed groups of splicing 
events that are co-ordinately regulated in a tissue-specific manner. For 
example, 1,147 splicing events are specifically included in heads and 
excluded in testes or ovaries, whereas 797 splicing events are excluded in 
heads but included in testes or ovaries (Fig. 4a). 

We identified hundreds of sex-specific splicing events from adult 
male and female RNA-seq data®. To further explore sex-specific splic- 
ing, we compared the splicing patterns in male and female heads enriched 
for brain tissues. There were striking differences in gene expression 
levels, however, only seven splicing events were consistently differenti- 
ally spliced at each time point after eclosion (average A'V > 20%), and 
these largely corresponded to genes in the known sex-determination 
pathway (Supplementary Information, section 19A). We find few exam- 
ples of head sex-specific splicing. This is in contrast to previous studies, 
which have come to conflicting conclusions and used either microar- 
rays analysing only a subset of splicing events or single read 36-bp RNA- 
seq’*”* with an order of magnitude fewer reads**. 

We identified 575 alternative splicing events that are differentially 
spliced in whole male and female animals (A‘¥ > 20%) and analysed 
the tissue-specific splicing patterns of each event (Fig. 4b). We found that 
186 of the 321 male-biased splicing events were most strongly included 
in testes or accessory glands, and 157 of 254 female-biased exons were 
ovary-enriched. Consistent with the extensive transcriptional differ- 
ences observed in testes compared to other tissues, the genes containing 
male-specific exons are enriched in functions related to transcription. 
In contrast, the female-specific exon containing genes are enriched in 
functions involved in signalling and splicing ((http://reactome.org)”, 
Supplementary Table 6). Together, these results indicate that the major- 
ity of sex-specific splicing is due to tissue-specific splicing in tissues 
present only in males or females. 
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Long non-coding RNAs 

A growing set of candidate long non-coding RNAs (IncRNAs) have 
been identified in Drosophila®?°”’. In FB5.45 there were 392 annotated 
IncRNAs, and it has been suggested that as many as 1,119 IncRNAs 
may be transcribed in the fly’*. However, this number was based on 
transcribed regions, not transcript models, and used non-stranded RNA- 
seq data**. We find 3,880 genes produce transcripts with ORFs encod- 
ing fewer than 100 amino acids. Of these, 795 encode conserved proteins 
(Methods) longer than 20 amino acids. For example, a single exon gene 
on the opposite strand and in the last intron of the early developmental 
growth factor spdtzle encodes a 42-amino-acid protein that is highly 
conserved across all sequenced Drosophila species. We identified 1,875 
candidate IncRNA genes producing 3,085 transcripts, 2,990 of which have 
no overlap with protein-coding genes on the same strand (Supplementary 
Data 2). Some of these putative IncRNAs may encode short polypep- 
tides, for example, the gene tarsal-less encodes three 11-amino-acid 
OREs with important developmental functions”. We determined protein 
conservation scores for each ORF between 20 and 100 amino acids 
(Supplementary Table 6). Of the 1,119 predicted IncRNAs”*, we pro- 
vide full-length transcript models for 246 transcribed loci; the remain- 
der were expressed at levels beneath thresholds used in this study. 
This is not surprising, the expression patterns of IncRNAs are more 
restricted than those of protein-coding genes: the average IncRNA is 
expressed (bases per kilobase per million mapped bases® (BPKM) > 1) 
in 1.5 developmental and 3.2 tissue samples, compared to 6.6 and 17 for 
protein-coding genes, respectively. Many IncRNAs (563 or 30%) have 
peak expression in testes, and 125 are detectable only in testes. Similarly 
restricted expression patterns have been reported for IncRNAs in humans 
and other mammals**”’. 

Interestingly, all newly annotated genes overlapping molecularly 
defined mutations with phenotypes are IncRNAs (Supplementary Table 2). 
For instance, the mutation D114.3 is a regulatory allele of spineless (ss) 
that maps 4 kb upstream of ss** and within the promoter of Mgn4221. 
Similarly, Mgn00541 corresponds to a described, but unannotated 2.0 kb 
transcript overlapping the regulatory mutant allele ci” of cubitus 


Figure 4 | Sex-specific splicing is mainly tissue- 
specific splicing. a, Clusters of tissue-specific 
splicing events. The scale bar indicates z-scores of 
W. Adult mated male (AdMM), adult mated female 
(AdMF) and adult virgin female (AdVF) heads are 
from 1-, 4- and 20-day-old animals, respectively. 
Testes are from 4-day-old adult males, and ovaries 
are from mated and virgin 4-day-old adult females. 
b, Sex-specific splicing events in whole animals are 
primarily testes- or ovary-specific splicing events. 
Adult male (AdM) and adult female (AdF) animals 
are 1, 5 and 30 days old. Accessory glands were 
dissected from 4-day-old adult males. The RNA- 
seq columns from heads, testes and ovaries are as 
described in a. 
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interruptus°’. It remains to be determined whether these mutations are 
a result of the loss-of-function of newly annotated transcripts or cis- 
acting regulatory elements (for example, enhancers) or both. 


Antisense transcription 


Drosophila antisense transcription has been reported”, but the cata- 
logue of antisense transcription has been largely limited to overlapping 
mRNAs transcribed on opposite strands. We identify non-coding anti- 
sense transcript models for 402 IncRNA loci that are antisense to mRNA 
transcripts of 422 protein-coding genes (for example, prd, Fig. 5a), and 
36 IncRNAs form ‘sense-antisense gene-chains’ overlapping more than 
one protein-coding locus, as observed in mammals****. In Drosophila, 
21% of IncRNAs are antisense to mRNAs, whereas in human 15% of 
annotated IncRNAs are antisense to mRNAs (GENCODE v.10). We 
assembled antisense transcript models for 5,057 genes (29%, compared 
to previous estimates of 15%**). For 67% of these loci, antisense expres- 
sion is observable in at least one cell line, indicating that sense/antisense 
transcripts may be present in the same cells. LncRNA-mediated anti- 
sense accounts for a small minority of antisense transcription: 94% of 
antisense loci correspond to overlapping protein-coding mRNAs tran- 
scribed on opposite strands, and of these, 323 loci (667 genes) share 
overlapping CDSs. The majority of antisense is due to overlapping UTRs: 
1,389 genes have overlapping 5’ UTRs (divergent transcription), 3,430 
have overlapping 3’ UTRs (convergent transcription), and 540 have 
both, meaning that, as with many IncRNAs, they form gene-chains across 
contiguously transcribed regions. A subset of antisense gene-pairs overlap 
almost completely (>90%), which we term reciprocal transcription. 
There are 13 such loci (Supplementary Fig. 5) and seven are male- 
specific (none are female-specific). 

The mRNA/IncRNA sense-antisense pairs tend to be more posi- 
tively correlated in their expression than mRNA/mRNA pairs, (mean 
r = 0.16 compared with 0.13, Kolmogorov-Smirnov (KS) two-sample 
one-sided test P< 10 °), and although this mean effect is subtle, the 
trend is clearly visible in the quantiles (95th percentile ncRNA/mRNA 
0.729 versus mMRNA/mRNA 0.634, Supplementary Fig. 6a). This effect 
is stronger when the analysis is restricted to cell line samples (Sup- 
plementary Fig. 6b). 

Even in homogenous cell cultures, evidence for sense-antisense tran- 
scription does not guarantee that both transcripts exist within individual 
cells: transcription could originate from exclusive events occurring in 
different cells. Cis-natural antisense transcripts (cis- NATs) are a sub- 
stantial source of endogenous siRNAs**, and their existence directly 
reflects the existence of precursor dsRNA. Cis-NAT-siRNA production 
typically involves convergent transcription units that overlap on their 
3’ ends, but other documented loci generate siRNAs across internal 
exons, introns or 5’ UTRs*”?. Analysis of head, ovary and testis RNAs 
showed that 328 unique sense/antisense gene pair regions generated 
21-nucleotide RNAs indicative of siRNA production (Supplementary 
Table 8), and these were significantly enriched (Supplementary Fig. 7a, 
Supplementary Methods) for pairs showing positively correlated expres- 
sion between sense and antisense levels across tissues (P = 2 X 10 °), 
embryo developmental stages (P = 4 x 10°), conditions (P= 9 X 10°“) 
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and across all samples (P = 3 X 10°). The tissue distribution of these 
cis-NAT-siRNAs showed a bias for testis expression (Supplementary 
Fig. 7b), with fourfold greater number relative to ovaries (P = 2 X 10— Nee 
binomial test) and sevenfold relative to heads (P= 4X 10 *°) and 
expression levels of siRNAs were substantially higher in testes than 
other tissues (Supplementary Fig. 7c). 

Over 80% of cis- NAT-siRNAs were derived from 3’-convergent gene 
pairs. Abundant siRNAs emanate from an overlap of the gryzun and 
CG14967 3’ UTRs (Supplementary Fig. 5). The remainder were dis- 
tributed amongst CDSs, introns and 5’ UTRs. We identified abundant 
testis-enriched siRNA production from a 5’-divergent overlap of Cyt- 
c-dand CG31808 (Fig. 5b) and from the entire CDS of dUTPase and its 
antisense non-coding transcript Mgn99994. 


Transcriptional effects of environmental stress 
Whole-animal perturbations each exhibited condition-specific effects, 
for example, the metallothionein genes were induced by heavy metals 
(Fig. 6a), but not by other treatments (Supplementary Table 9). The 
genome-wide transcriptional response to cadmium (Cd) exposure involves 
small changes in expression level in thousands of genes (48h after 
exposure), but only a small group of genes change > 20-fold, and this 
group includes six IncRNAs (the third most strongly induced gene is 
CR44138, Fig. 6a, Supplementary Fig. 8a). Four newly modelled IncRNAs 
are differentially expressed (1% false discovery rate (FDR)) in at least 
one treatment, and constitute newly described eco-responsive genes. 
Furthermore, 57 genes and 5,259 transcripts (of 811 genes) were detected 
exclusively in these treatment samples. Although no two perturbations 
revealed identical transcriptional landscapes, we find a homogeneous 
response to environmental stressors (Fig. 6b, Supplementary Fig. 8b). 
The direction of regulation for most genes is consistent across all treat- 
ments; very few are upregulated in one condition and downregulated 
in another. Classes of strongly upregulated genes included those anno- 
tated with the GO term “Response to Stimulus, GO:0050896” (most 
enriched, P value < 1 X 10 '°, Supplementary Fig. 8c), and those that 
encode lysozymes (> tenfold), cytochrome P450s, and mitochrondrial 
components mt:ATPase6, mt:Col, mt:Colll (> fivefold). Genes encod- 
ing egg-shell, yolk and seminal fluid proteins are strongly downregu- 
lated in response to every treatment except ‘cold2’ and ‘heat shock’ 
(Supplementary Fig. 8d). For these two stressors, samples were collected 
30 min after exposure, corresponding to an ‘early response test’ showing 
suppression of germ cell production is not immediate. 


Discussion 


Most transcriptional complexity in Drosophila occurs in tissues of the 
nervous system, and particularly in the functionally differentiating 
central and peripheral nervous systems. A subset of ultra-complex genes 
encodes more than half of detected transcript isoforms and these are 
dramatically enriched for RNA editing events and 3’ UTR extensions, 
both phenomena largely specific to the nervous system. Our study indi- 
cates that the total information output ofan animal transcriptome may 
be heavily weighted by the needs of the developing nervous system. 


a Chrat 12,084K 12,085K 12,086K 12,087K b 16.705K 16,710K 16,715K —— Chr2L 
"| 456 8 
Imaginal disc -_ = — es | 
+ g . : = : 21nt testes ; 
-— 1 
Tissue-specific 201 1,800 4,086 eto 52,202 
RNA-seq + Testes ee eee I a ; 
_| oo" — <aiieal er Testes 
~1,800 -17 “1 3 ee nO 
-397 -5 


Gene models Mgn00375 


SSS — = 


Figure 5 | Examples of antisense transcription. a, 5’/5’ overlapping 
bidirectional antisense transcription at the prd locus. Short RNA sequencing 
does not reveal substantial siRNA (that is, 21-nt-dominant small RNA) signal 
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in this region (data not shown). b, A 5’/5’ antisense region that produces 
substantial small RNA signal on both strands. nt, nucleotide. 
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Figure 6 | Effects of environmental perturbations on the Drosophila 
transcriptome. Adults were treated with caffeine (Cf), Cd, Cu, Zn, cold, heat or 
paraquat (PQ). a, A genome-wide map of genes that are up- or downregulated 
as a function of Cd treatment. Labelled genes are those that showed a 20-fold 
(<10% FDR) change in response (linear scale). Genes highlighted in red are 


The improved depth of sampling and spatiotemporal resolution 
resulted in the identification of more than 1,200 new genes not dis- 
covered in our previous study of Drosophila development’. A large 
fraction of the new genes are testes-specific, and many of these are 
antisense RNAs, as previously described in mammals*’. Some new 
IncRNAs, such as Mgn94020 (Fig. 1), form sense/antisense gene-chains 
that bring distant protein-coding genes into transcriptional relation- 
ships, another phenomenon previously described only in mammals“. 
Whenever Mgn94020 is detectably transcribed, the genes on the oppo- 
site strand in its introns are not, indicating that its transcription may 
serve a regulatory function independent of the RNA transcribed. The 
presence of short RNAs at many regions of antisense transcription 
indicates that sense and antisense transcripts are present in the same 
cells at the same times. Many of these Drosophila antisense transcripts 
correspond to ‘positionally equivalent” antisense transcripts in human. 
In the two species we found antisense IncRNAs opposite to ortholo- 
gous protein-coding genes. The apparent positional equivalence of fly 
and human antisense transcription at genes like Monocarboxylate 
transporter 1 (MCT1), even-skipped (EVX1), CTCF (CTCF), Adenosine 
receptor (ADORAZ2A), and many others’®** across 600 million years of 
evolution suggests a conserved regulatory mechanism basal to sexual 
reproduction in metazoans. 

Perturbation experiments identified new genes and transcripts, but 
perhaps more importantly, a general response to stress that is broader 
than the heat shock pathway. A similar study conducted on marsh 
fishes in the wake of the Deepwater Horizon incident in the Gulf of 
Mexico*’ demonstrated that the killifish response to chronic hydro- 
carbon exposure included induction of lyzosome genes, P450 cyto- 
chromes and mitochondrial components, and the downregulation of 
genes encoding eggshell and yolk proteins*’. This overlap of expres- 
sional responses by gene families across phyla suggests a conserved 
metazoan stress response involving enhanced metabolism and the 
suppression of genes involved in reproduction. 

We defined an extensive catalogue of putative IncRNAs. However, 
many genes are known to encode poorly conserved, short polypep- 
tides, including genes specific to the male gonad and accessory gland. 
Analysis of ribosome profiling initially indicated that a number of 
mammalian IncRNAs may be translated”, but this observation has 
been difficult to validate by proteomics*’, and further analysis has sug- 
gested that although IncRNAs have signatures of ribosome occupancy, 
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those identified in larvae*’. Some genes are omitted for readability, the complete 
figure and list of omitted genes are given in Supplementary Fig. 8a. b, Heat map 
showing the fold change of genes with a FDR < 10% (differential expression) in 
at least one sample (log, scale). PL, pre-lethal. 


they are not translated. Therefore, while we refer to these RNAs as 
‘non-coding’, additional data are needed to determine if they produce 
small polypeptides. 

The biological consequences of many of the phenomena reported 
here, including the observation that many genes encoding RNA bind- 
ing proteins exhibit extraordinary splicing complexity, often within 
their 5’ UTRs, require further study. The splicing factor pUf68 encodes 
more than 100 alternatively spliced 5’ UTR variants, but encodes a single 
protein. The idea that splicing factors may regulate one another to gen- 
erate complex patterns of splicing is consistent with recent computa- 
tional models**. More generally, the role of complex splicing in the 
adult and developing nervous system is unclear. To answer the ques- 
tions that come with increasingly complete transcriptomes in higher 
organisms, it will be necessary to study gene regulation downstream of 
transcription initiation, including the regulation of splicing, local- 
ization and translation. 


METHODS SUMMARY 

Animal staging, collection and RNA extraction. Tissues were dissected from 
Oregon R larval, pupal and adult staged animals synchronized with appropriate 
age indicators. Pupal and adult animals were treated with a number of environmental 
stresses. RNA was isolated using TRIzol (Invitrogen), treated with DNase and 
purified on a RNAeasy column (Qiagen). Poly(A)” RNA was prepared from an 
aliquot of each total RNA sample using an Oligotex kit (Qiagen). 

RNA-seq. Libraries were generated and sequenced on an Illumina Genome Analyzer 
IIx or HiSeq 2000 using paired-end chemistry and 76-bp or 100-bp cycles. The 454 
sequencing used poly(A)* RNA from Oregon R adult males and females and 
mixed-staged y' cn' bw’ sp'. embryos. Sequences are available from the Short 
Read Archive (Accession numbers available in Supplementary Table 10) and the 
modENCODE website (http://www.modencode.org/, Supplementary Table 10). 
CAGE* was sequenced on an Illumina Genome Analyzer IIx with 36-bp reads. 
Poly(A) * seq was generated using a custom protocol (Supplementary Methods). 
Analysis. RNA-seq, CAGE and poly(A) * reads were mapped and filtered’. GRIT 
was used to identify transcript models’*. Expression levels for genes and exons 
were computed in BPKM®. GSC P values were computed”. 'P values were calcu- 
lated with MISO“. Differential expression analysis was conducted with a custom 
method (Supplementary Methods) and with DEseq’’. RPS-BLAST was used to 
conduct the conserved domain search with version v3.08 of the NCBI Conserved 
Domains Database (CDD) (Supplementary Methods). Orthology analysis between 
human and fly was conducted using DIOPT (http://www. flyrnai.org/cgi-bin/DRSC_ 
orthologs.pl). Phenotypic alleles were downloaded from FlyBase 15.50, and were 
selected as any allele localized to the genome with a disease phenotype. 
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Discovering the structure and dynamics of transcriptional regulatory events in the genome with cellular and temporal 
resolution is crucial to understanding the regulatory underpinnings of development and disease. We determined the 
genomic distribution of binding sites for 92 transcription factors and regulatory proteins across multiple stages of 
Caenorhabditis elegans development by performing 241 ChIP-seq (chromatin immunoprecipitation followed by se- 
quencing) experiments. Integration of regulatory binding and cellular-resolution expression data produced a spatio- 
temporally resolved metazoan transcription factor binding map. Using this map, we explore developmental regulatory 
circuits that encode combinatorial logic at the levels of co-binding and co-expression of transcription factors, char- 
acterizing the genomic coverage and clustering of regulatory binding, the binding preferences of, and biological processes 
regulated by, transcription factors, the global transcription factor co-associations and genomic subdomains that suggest 
shared patterns of regulation, and identifying key transcription factors and transcription factor co-associations for fate 


specification of individual lineages and cell types. 


In multicellular organisms, transcription factors bind at cis-regulatory 
elements in the genome to mediate diverse gene expression programs 
with exquisite spatiotemporal control’*. However, owing to the pau- 
city of in vivo developmental stage transcription factor binding data 
and cellular transcription factor expression data, the integrated maps 
required to study transcriptional control of development with spatio- 
temporal resolution are lacking. 

In this work, we analyse regulatory activity ofa broad set of C. elegans 
transcription factors in one or more developmental stages. Exploiting 
recently developed methods**, we integrate transcription factor bind- 
ing data with an initial cellular-resolution map of transcription factor 
expression in the embryo. Our integrated analyses support the discov- 
ery of many key transcription factors and candidate transcription factor 
co-associations for fate specification, providing insights into the tem- 
poral and spatial dynamics of regulatory interactions in development. 


Large-scale survey of regulatory binding 
As part of the modENCODE consortium, we performed 241 ChIP-seq 
experiments to identify in vivo binding sites for 92 (10%) C. elegans tran- 
scription factors and regulatory proteins (collectively termed factors) in 
one or more stages of development or treatments (Fig. 1a and Supplemen- 
tary Table 1). To identify factor binding from the approximately 5.1 billion 
raw reads, we developed a uniform processing pipeline (Extended Data 
Fig. la-e and Methods) that enables comparison of orthologous tran- 
scription factor properties’, such as sequence preferences (Extended 
Data Fig. 1f-h). Eight previously reported® experiments failed to pass 
our quality-control checks and were thus removed from consideration. 
We focused our analyses on embryonic and larval (L1-L4) stages, 
examining a total of 397,539 reproducible binding sites distributed 
across 33,833 binding regions in the genome. Collectively, factor binding 
(excluding RNA polymerases) is spread throughout 21.7% of the C. ele- 
gans genome (Fig. 1b), an upper-bound defined by ChIP-seq resolution”. 


Weestimate that—within our ChIP-seq resolution and sensitivity—we 
have identified approximately 90% of the regulatory binding regions 
(albeit not the majority of binding events; Extended Data Fig. 1i). Con- 
sistent with this estimate, we observe binding within 2 kilobases (kb) 
upstream ofa transcription start site (TSS) for 91.3% of genes (Extended 
Data Fig. 1j)’°. 


HOT regions are dynamic in development 
Previous studies*''"* have revealed regions in metazoan genomes with 
heavily clustered transcription factor binding, termed high-occupancy 
target (HOT) regions. The availability of multiple data sets across stages 
allowed us to examine the dynamics of HOT regions through develop- 
ment. We identified HOT and extreme-occupancy target (XOT) regions 
for each developmental stage, where significant enrichments (false dis- 
covery rate (FDR) <5% and <1%, respectively) in transcription factor 
binding sites are observed (Extended Data Fig. 2a—c). We found a total 
of 9,142 HOT regions (spanning 2,948 genomic regions) in at least one 
developmental stage, and 858 constitutive HOT regions occurring across 
all stages assayed (Fig. 1b, Supplementary Table 2). Constitutive HOT 
regions are enriched in promoters of genes with house-keeping func- 
tions (Extended Data Fig. 2d and Supplementary Table 3). However, 
most HOT regions are dynamic across development: 31-56% of HOT 
regions change between sequential stages and occupancy at larval L4- 
specific HOT regions increases as development progresses (Fig. 1c). 
Across developmental stages, 77-85% of HOT regions occur within 
2 kb upstream of an annotated TSS (Extended Data Fig. 2e). Further- 
more, 88.8% and 88.7% of constitutive HOT regions occur in promoter 
or enhancer states in embryos and L3 larvae, respectively (Extended 
Data Fig. 2f, g). These results indicate that HOT regions reside at important 
regulatory locations (Fig. 1d, Extended Data Fig. 3a—c) in the genome 
and are dynamic during development. 
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Figure 1 | Large-scale regulatory analysis of the C. elegans genome. 

a, Transcription factors (TFs) and regulatory proteins assayed per 
developmental stage (or treatment) in 241 quality-filtered ChIP-seq 
experiments. Stages and treatments are early embryo (EE), late embryo (LE), 
embryo mixed (EM; EE and LE), larval L1 (L1), larval L2 (2), larval L3 (L3), 
larval L4 (L4), young adult (YA), mixed larval and young adults (LY), day 4 
adult (D4), and starved L1 (S1). Embryonic data sets were combined into a 
compiled embryonic stage (EX). Analyses in this report focus on embryonic 
(yellow) and larval (blue) experiments (N = 187). b, Genomic coverage 
(percent of genomic bases) of regulatory binding (excluding RNA polymerases) 
in 181 C. elegans (outer circle) and 339 H. sapiens (inner circle) ChIP-seq 
experiments. Genomic coverage of constitutive HOT (cHOT), HOT, and other 
regulatory binding (RGB) regions are highlighted in red, yellow and blue, 
respectively. Constitutive XOT (cXOT) and XOT percentages are shown in 


Preferences and roles of regulators 


Factors displayed a range of chromatin state’* preferences, with a general 
bias towards promoter and enhancer states (Extended Data Fig. 3d, e). 
Although generally clustered near TSSs, many factors display enrich- 
ments for upstream or downstream binding (Extended Data Fig. 3f). 
Proximal and downstream binders include RNA Pol II (AMA-1) and 
other regulators of transcription initiation and elongation, respectively. 
Upstream binders may be enriched for chromatin remodellers and 
factors that recruit the transcriptional machinery. For example, bind- 
ing of BLMP-1—the orthologue of the human repressor PRDM1 (refs 15, 
16)—is tightly concentrated upstream of TSSs (Fig. le). Likewise, ALY-2, 
a human THOC4 messenger RNA export factor orthologue”, exhibits 
increased binding downstream of TSSs during development (Fig. le) and 
is increased at elongation chromatin states relative to other factors. Gen- 
erally, transcription factors assayed in multiple stages retain their upstream 
and downstream binding preferences. Remarkably, RNA Pol II position- 
ing shifts (Fig. 1f) from a strong elongating distribution in the early em- 
bryo to weaker elongation distributions in later stages, consistent with 
its previously observed continued presence at promoters that are down- 
regulated during development*”’. 

Gene ontology (GO) analysis of the candidate protein-coding gene 
targets revealed 6,347 functional associations (BH-corrected, P < 0.05) 
for 75 factors (Extended Data Fig. 4a and Supplementary Table 4), 
suggesting biological roles for transcription factors of previously un- 
known function. The unstudied factors FKH-10 and C34F6.9 group 
with the established neuronal fate regulators SEM-4 (ZNF236), MAB-5 


parenthesis. CHOT, HOT and RGB region coverage in the human genome are 
0.17%, 1.4% and 6.1%, respectively’. c, Cut-off-normalized occupancy levels in 
126 embryo-specific (yellow) and 91 larval L4-specific (blue) HOT regions. 
Bars indicate the 25th and 75th percentiles. d, Chromatin state, as determined 
in ref. 14, of L3 larvae binding regions by occupancy. RGB-, HOT- and XOT- 
region occupancy levels are indicated along the x axis as blue, yellow and red 
bars, respectively. Poly. Repress. and Heterochr. indicate Polycomb, repressed 
and heterochromatin states. e, f, Signal densities near enzymatically derived 
TSSs” for BLMP-1 and ALY-2, and RNA Pol II. g, Functional (GO term) 
enrichment for gene targets of binding’’. A subset of biological process terms 
(level =4) are shown for factors enriched (Benjamini-Hochberg-corrected, 
P<0.01) in synaptic transmission; early MEP-1 and DPL-1 data sets are 
included for comparison. h, Example signal tracks near the UNC-104 locus. 


(HOXA2 and HOXB2), SAX-3 (ROBO1, ROBO2 and ROBO3), CES-1 
and ZAG-1 in targeting neurotransmission genes, with C34F6.9 addi- 
tionally regulating muscle development and sex differentiation (Fig. 1g). 
Most of these factors, including SEM-4 and C34F6.9 in L2 larvae, appear 
to regulate the neuronal kinesin UNC-104 (human KIF1A, KIFIC, Fig. 1h). 
Although expression of FKH-10 is restricted to six neuronal cells near 
the terminal bulb of the pharynx”, its specific molecular role in neuronal 
regulation and its regulatory targets were heretofore unknown. 

Functional associations also demonstrate malleability of regulation. 
For example, UNC-62 transitions from regulating diverse muscle and 
neuronal development genes in embryos to regulating lipid metabol- 
ism processes in L4 larvae (Extended Data Fig. 4b). These changes are 
consistent with known diverse UNC-62 roles in motor neuron and vulval 
development, as well as locomotion, and ageing””””. Similarly, SAX-3 tran- 
sitions from targeting neuronal genes in L2 larvae to targeting carbohyd- 
rate and lipid metabolism genes in L4 larvae. The change in UNC-62 
regulatory targets coincides with increased expression of the UNC-62 
(7a) isoform in late larval and adult intestine”, which has been shown 
to affect lifespan’*. Such early development regulators may often target 
metabolic regulation in later developmental stages”. 


Global and subdomain transcription factor 
co-associations 

Global analyses of pairwise transcription factor co-associations™ revealed 
a multitude of established and novel co-associations (Fig. 2a), many 
stage-specific clusters of co-association (Extended Data Fig. 5a), as well 
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Figure 2 | Global and domain-specific patterns of transcription factor 
co-association. a, Matrix of global pairwise (i, j) factor co-association strengths 
(N = 17,391) as defined by promoter interval statistics“. Co-association scores are 
scaled by the standard deviation (uncentred) for visualization purposes. Co- 
associations of interest and discussed in the text are highlighted. LX indicates larval 
stages L1-L4. A higher-resolution version is available in Extended Data 

Fig. 10. CES-1-FKH-10 co-associations are highlighted in the inset, top. Co- 
association strengths (unscaled) between early embryo and later stages are shown in 
the inset, bottom, for RNA Pol II-specific binding (blue), and for all factor-specific 
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binding (light blue). b, Embryonic (EX) binding regions (N = 6,555) were 
clustered into a SOM describing 240 co-association patterns among 26 factors. 
c, Binding signatures (fraction of modules bound by each factor) of the learned co- 
association patterns are shown. The relative number of factors per co-association 
pattern, expression from overlapping promoters, distance to TSSs, and number of 
modules with each co-association pattern are indicated as a fraction of the 
maximum observed across co-association patterns. d, Functional enrichment for 
regions with UNC-62-bound co-association patterns of the embryo SOM. 
Molecular function (mf) and biological process (bp) terms are shown. 
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Figure 3 | Stage specificity in higher-order transcription factor 
co-associations. a, Embryonic (EX) and larval L1 binding SOM with raw 
binding sites. Binding data for factors (N = 15) assayed in embryos and L1 
larvae was assigned to 25,261 stage-specific binding modules as shown in the 
inset. Stage-specific binding modules were clustered into an SOM describing 
192 co-association patterns. The SOM is coloured by the embryonic (versus L1) 
stage specificity of the learned co-association patterns, measured as the fraction 
of binding modules that are embryonic. b, Histogram of preceding (T,) versus 
subsequent (T>) stage specificities. 


as differences in co-associations between expressed and repressed promoters 
(Extended Data Fig. 5b). FOS-1-JUN-1 as well as GEI-11-LIN-15B 
co-associations are readily apparent in L1 and L3 larvae, but not in L4 
larvae. Likewise, ELT-3 and BLMP-1, which preferentially reside at 
molting and cuticle development gene promoters, co-localize in L1 
larvae but not in embryos. The neuronal regulators CES-1 and FKH-10 
co-associate across larval stages (L1, L3-L4) but their co-association is not 
apparent in late embryogenesis (Fig. 2a). Changes in co-association are 
often correlated with the presence of additional factors, for example, in 
the embryo to larval L1 transition, the increased ELT-3-BLMP-1 co- 
association is also accompanied by increased GEI-11 co-associations 
with these factors (Extended Data Fig. 5c—f). Other factors remain largely 
invariant through multiple stages, for example, ZTF-11, a human MTF1 
orthologue. 

Functionally related factors were often co-associated. For example, 
FOS-1, NHR-77 and PQM-1 target promoters of genes in cellular lipid 
and ketone metabolic processes. Similarly, EFL-1 and LIN-35, the known 
interacting orthologues ofhuman E2F and RB, showa strong co-association 
in L1 larvae, where they target membrane organization and endocytosis 
genes. 

We observed strong similarities in RNA Pol II binding within embry- 
onic (early and late embryo) and within larval L1-L4 stages, but larval 
RNA Pol II binding is only marginally and weakly co-associated with 
embryonic binding, reflecting the dynamic establishment of the tran- 
scriptional machinery through development (Fig. 2a). 

To uncover higher-order co-associations (involving two or more fac- 
tors), and their genomic subdomains, we applied self-organizing maps 
(SOMs), an unsupervised machine learning technique”’. For each devel- 
opmental stage, we trained SOMs to cluster genomic regions with shared 
transcription factor co-association patterns (Fig. 2b and Extended Data 
Fig. 6a—d), thereby concomitantly identifying transcription factor co- 
association patterns (Fig. 2c) and their target regulatory regions. 

We performed GO analysis of the target regulatory regions for 240, 
390, 439, 390 and 409 clusters in the embryo, larval L1, L2, L3 and L4 
SOMs, respectively, revealing enrichments across 1,209 GO terms (BH- 
corrected, P < 0.05, Extended Data Fig. 6e and Supplementary Table 5). 
As illustrated in the embryo, higher-order co-association patterns show 
a richness of functional associations, with 137 clusters spanning 273 GO 
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terms. A close examination of UNC-62 co-association patterns reveals 
how diverse patterns for individual factors can result in specialized 
functional targeting (Fig. 2d). Regions bound exclusively by UNC-62 
and HLH-1 are highly-enriched at muscle development promoters. In 
contrast, genes targeted by more complex UNC-62 co-associations are 
enriched in synaptic transmission, regulation of cell death, and chro- 
matin assembly functions. Higher-order co-associations are largely stage- 
specific (Fig. 3), a feature modulated by changes in the observed number 
of binding sites for individual factors between stages (Extended Data 
Fig. 7). 


Spatiotemporal transcription factor expression analysis 


Although studies in C. elegans and D. melanogaster have led analyses 
of organismal-level regulatory binding circuits, such studies have gen- 
erally lacked cell-type and tissue resolution. We sought to remedy this 
deficiency by tracking”® the expression of 180 diverse genes (mostly 
transcription factors) through early embryogenesis with cellular resolu- 
tion (Extended Data Fig. 8a—d). Our expression data, from previously 
published** and newly acquired series, includes 36 factors with genome- 
wide binding measurements (13 embryo, 23 larval). 

We observed common and distinctive cellular expression patterns 
amongst a wide distribution of broadly- and narrowly-expressed genes 
(Extended Data Fig. 8e, f). For example, expression of DMD-4, an ortho- 
logue of the vertebrate spinal circuit configuration regulator DMRT3, 
is tightly limited to posterior regions of the pharynx. Similarly, F49E8.2 
expression is exclusive to the Z2 and Z3 germ cells (Extended Data Fig. 8a). 
95.7% of pairs of tracked cells show distinct gene expression signatures 
(R < 0.75). 

Cellular expression mapped the regulatory activity of 16 assayed 
factors to specific tissues (Fig. 4a). As expected, the known regulators of 
pharynx and muscle development, PHA-4 and HLH-1, were respect- 
ively enriched in these tissues. The co-associated factors, MEP-1 and 
DPL-1 (human DP1 and DP2 orthologue), although broadly expressed, are 
enriched in neuronal lineages. This is consistent with the observed MEP-1 
targeting of neuronal function genes in the larvae, and provides further 
support for the coordinate activities of MEP-1 and DPL-1 in targeting 
membrane organization, receptor-mediated endocytosis, and cell-cycle 
genes (Fig. 1f and Supplementary Table 4). 

More complex patterns of co-expression and co-association were 
observed in epidermal tissues, where CEH-16 (human Engrailed), and 
particularly ELT-1 and NHR-25 expression is concentrated. In both 
L2 and L3 larvae, ELT-1 and NHR-25 are modestly co-associated. ELT- 
1 targets transcriptional regulators, including NHR-25, and tail mor- 
phogenesis genes, whereas NHR-25 targets nuclear organization and 
genitalia development genes (Supplementary Table 4). However, larval 
L2 binding of ELT-1 and NHR-25 is co-associated with that of CEH- 
16, whose early embryonic expression is primarily concentrated in a 
subset of pharynx and epidermal cells. 

In early embryo (Fig. 4a) and L1 larvae’*, HLH-1, is primarily and 
broadly expressed in muscle tissues whereas posterior-specific HOX 
factors MAB-5, and EGL-5 are expressed in a small subset of posteriorly 
placed muscle, epidermal and neuronal precursors. We observed modest 
co-association signals between embryonic HLH-1 and MAB-5 binding, 
and larval L3 EGL-5 binding, perhaps reflecting the intersection of 
tissue-specific and positional regulatory programs. As expected, HLH- 
1 targets muscle differentiation genes (together with UNC-62); however 
in GO analysis, we only detect MAB-5 targeting of diverse neuronal 
functions (in mixed embryos and L2 larvae), consistent with its later 
role in neuron specification”. CO5D10.1, whose early embryonic expression 
is also restricted to muscle tissues is not co-associated with the above 
factors, and neither C05D10.1 nor EGL-5 showed specific functional 
associations. Thus, although co-associated factors were often expressed 
in the same tissue, this is not pervasive. Moreover, these co-expression 
patterns are dynamically established during embryogenesis (data not 
shown). 
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Figure 4 | Cell-type and lineage resolution of regulator activity and 
transcription factor co-associations. a, Tissue enrichment (—logio, P value) 
and depletion (log;9, P value) scores for the expressing population of each gene 
are shown (Fisher’s exact, Bonferroni-corrected). Only genes with significant 
enrichments (or depletions) are shown. b, Co-association strength (Fig. 2a) 
versus cellular overlap coefficient for 13 focus factors. The Jaccard index for the 
cellular overlap is indicated for each gene pair by ring size and colour. 

c, Cellular-resolution regulatory binding SOM. Cellular-resolution binding 
modules were generated by annotating in each cell, the binding of focus factors 


Refinement of embryonic co-associations 


Despite extensive studies in metazoan regulatory networks, the rela- 
tionship between regulator binding in overlapping genomic regions 
and co-expression in cell-types is not well studied. We examined this rela- 
tionship among 13 ‘focus’ factors, for which both embryonic binding and 
cellular expression were assayed. This analysis is limited to the first half of 
embryogenesis, where expression was directly measured in 696 ‘focus’ 
cells. Later events may occur that would not be identified in our analysis. 
We found a poor correlation between transcription factor co-expression 
and co-association (R = 0.07, Fig. 4b), consistent with precise coordina- 
tion of these separate processes underlying the differential establishment 
of cell- and lineage-specific regulatory circuits. 

Integrated analysis shows that MEP-1 is co-associated and co-expressed 
with similarly broadly expressed factors (LIN-13, CEH-39) and narrowly 
expressed factors (CES-1, CEH-26), suggesting that MEP-1 often works 
in cis with these additional factors. MEP-1 binding is co-associated with 
CES-1 and CEH-26 in embryos, and expression of these factors is narrowly 
restricted within the MEP-1-expressing population. These MEP-1-—CES-1 
and MEP-1—CEH-26 co-associations are reminiscent of MEC-3-UNC-86 
interactions in which the classic ‘terminal selector’ MEC-3 heterodimerizes 
with the broadly expressed UNC-86 exclusively in touch sensory neurons”. 
Thus, the co-association and co-expression of MEP- 1-CEH-26 suggests 
CEH-26 may function as a terminal selector in head and tail neurons, 
and the excretory cell. The spatiotemporally-resolved co-association 
analyses demonstrate how broadly expressed factors, such as MEP-1 and 
LIN-13—which targets both neurotransmission functions and genitalia 
development—can have diversified functional roles during development 
through co-associations with narrowly expressed factors. 
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expressed in the cell. Cellular-resolution binding modules (inset) were clustered 
into a SOM with 268 learned co-association patterns, 161 (68%) of which were 
discovered in the data. The SOM is colored by the number of factors in the 
learned co-association patterns. d, Tissue classes and co-association signatures 
are shown for 43 co-association patterns with significant enrichments. Tissue 
enrichments of interest are highlighted red. Circle size and colour indicate the 
Jaccard index (%) of cellular overlap. ‘Death’ and ‘Excretory’ represent tissue 
designations for cells with apoptotic and excretory cell fates. 


To determine how co-binding and co-expression co-ordinately define 
regulatory patterning in distinct cell-types and genomic regions, we in- 
tersected cellular expression and binding data by mapping focus factor 
binding to in silico genomes for cells where the factors are expressed. This 
procedure resulted in 4,779,810 binding sites distributed across 2,858,477 
cell-resolved binding modules. We applied an SOM to cluster the cell- 
resolved binding modules by co-association patterns, uncovering 161 tran- 
scription factor co-association patterns and the genomic subdomains and 
specific cellular subsets of the embryo in which they may occur (Fig. 4c). 
The cellular distribution of transcription factor co-association patterns 
revealed co-associations shared among and unique to specific cell fates 
(Fig. 4d). For example, we found that distinct MEP-1, CEH-26 and NHR- 
2 co-associations were specific to neuronal tissues. Similarly, muscle cells 
were enriched in various HLH-1 co-associations. 

We identified 39 co-association patterns whose cellular distribution 
coincides with the cellular expression of at least one of 124 target genes 
(non-focus factors; Bonferroni-corrected, P < 0.01). Focus factor bind- 
ing allowed us to analyse co-association patterns at the promoters of 44 
of these genes (where binding is observed). For 28 (63.6%) of these genes, 
co-association patterns were detected at the promoter and the gene’s cel- 
lular expression matched the cellular distribution. Moreover, the overlap 
between the expression cells for a gene and the co-association cells is 
higher in cases where the co-association occurs in the promoter of the 
gene (Wilcoxon, P= 5.1 X 107°, Extended Data Fig. 8g). This result 
indicates that co-associations at promoters are correlated with cellular 
expression patterns for genes, and suggests a functional regulatory role 
for the discovered co-associations. 
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Discussion 


We have generated a high-coverage transcription factor binding map 
of the C. elegans genome, revealing regulatory targets, co-associations, 
and dynamics across five developmental windows for 92 diverse fac- 
tors. Gene targets suggest a multitude of functional associations for 75 
factors, many previously unannotated and with clear mammalian ho- 
mologues. Our work reveals extensive regulatory rewiring through de- 
velopment, with temporal differentiation of HOT regions in the genome, 
factor positioning preferences, regulatory targets, and co-associations. 

A systematic analysis of transcription factor co-associations through 
development reveals sets of factors that assemble at genomic regions 
associated with more than 1,200 biological functions (GO terms), with 
probable spatiotemporal specificity. As illustrated with UNC-62, these 
higher-order co-associations reveal how individual transcription factors 
can participate in distinct transcription factor co-associations patterns at 
promoters of functionally diverse genes. 

Lastly, cellular-resolution expression tracking allowed us to map the 
activity of 35 factors to precise cell and tissue types, demonstrating lineage- 
specific activities for 16 factors in the early embryo. Importantly, co- 
associations that are observed in whole-organism binding data are not 
always evident at the cellular level, highlighting the need to incorporate 
such information in our understanding of regulatory circuits. As addi- 
tional expression patterns and transcription factor binding sites are deter- 
mined, and methods to track transcription factor binding with cell-type 
specificity are developed, the broader and more precise regulatory logic 
of development should emerge. 


METHODS SUMMARY 


ChIP-seq assays of wild-type (N2) and transgenic nematodes were performed under 
controlled conditions (Extended Data Fig. 9) as described”. Experimental and com- 
putational methods are described in detail in the Methods. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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A type Ia supernova is thought to be a thermonuclear explosion 
of either a single carbon-oxygen white dwarf or a pair of merging 
white dwarfs. The explosion fuses a large amount of radioactive °°Ni 
(refs 1-3). After the explosion, the decay chain from *°Ni to *°Co to 
°°Fe generates y-ray photons, which are reprocessed in the expand- 
ing ejecta and give rise to powerful optical emission. Here we report 
the detection of *°Co lines at energies of 847 and 1,238 kiloelectron- 
volts and a y-ray continuum in the 200-400 kiloelectronvolt band 
from the type Ia supernova 2014J in the nearby galaxy M82. The line 
fluxes suggest that about 0.6 + 0.1 solar masses of radioactive °°Ni 
were synthesized during the explosion. The line broadening gives a 
characteristic mass-weighted ejecta expansion velocity of 10,000 + 
3,000 kilometres per second. The observed y-ray properties are in 
broad agreement with the canonical model of an explosion of a white 
dwarf just massive enough to be unstable to gravitational collapse, 
but do not exclude merger scenarios that fuse comparable amounts 
of °°Ni. 

The detailed physics of the explosion of type Ia supernovae (for exam- 
ple deflagration or detonation) and the evolution*” of a compact object 
towards explosion remain a matter of debate*”. In a majority of models, 
the ejecta are opaque to y-ray lines during first 10-20 days after the ex- 
plosion (because of Compton scattering). At later times, the ejecta become 
progressively more transparent and a large fraction of -rays escapes. 
This leads to a robust prediction” of y-ray emission from type Ia super- 
novae after few tens of days, dominated by the y-ray lines of °°Co. Such 
emission has been observed before: the down-scattered hard X-ray con- 
tinuum from supernova (SN) 1987A in the Large Magellanic Cloud was 
seen halfa year after the explosion'™”, and y-ray lines of °°Co were detected 
several months later’*"*. That was a core-collapse (type II) supernova, 
in which the cause of the explosion is completely different from that of 
type Ia supernovae. Type Ia events, despite being intrinsically brighter, 
are more rare than core-collapse supernovae, and before SN 2014] there 
was not one close enough to detect. The recent type Ia SN 201 lfe ata dis- 
tance of D ~ 6.4 Mpc yielded only an upper limit on the *°Co line flux". 

SN 2014J in M82 was discovered’® on January 21, 2014. The recon- 
structed’’ date of the explosion is January 14.75 uT. This is the nearest 
type Ia supernova to be detected in at least four decades, at the M82 
distance of D ~ 3.5 Mpc (ref. 18). The European Space Agency satellite 
INTEGRAL)” started observing SN 2014J in 2014, from January 31 to 
April 24 and from May 27 to June 26. We use the INTEGRAL data taken 
between days 50 and 100 after the explosion, the period when the ex- 
pected flux from y-ray lines of *°Co is close to the maximum”. This set of 
observations by the SPI and ISGRI/IBIS instruments on board INTEGRAL 
has been analysed, excluding periods of strong solar flares, which cause 
large variations in the instrumental background (Methods and Extended 
Data Fig. 1). The spectrum derived assuming a point source at the 
position of SN 2014] is shown in Fig. 1 using red and blue points for 
SPI and ISGRL, respectively. 


The model spectrum is binned similarly to the observed supernova 
spectrum. The signatures of the 847 and 1,238 keV lines are clearly seen 
in the spectrum (along with tracers of weaker lines of °Co at 511 and 
1,038 keV). The low-energy (<400 keV) part of the SPI spectrum is not 
shown because of possible contamination due to off-diagonal response 
of the instrument to higher-energy lines. At these energies, we use ISGRI/ 
IBIS data instead (Methods). 

By varying the assumed position of the source and repeating the flux- 
fitting procedure using SPI data (Methods) we construct a 40° 40° 
image of the signal-to-noise ratio in the 800-880 and 1,200-1,300 keV 
energy bands (Fig. 2). SN 2014] is detected at 3.9 s.d. and 4.3 s.d. in these 
two bands, respectively. These are the highest peaks in both images. 

The images obtained by ISGRI at lower energies (100-600 keV) dur- 
ing the observations of SN 2014] and in October-December 2013, that 
is, a few months before the SN 2014J explosion (see Methods for the 
details of the earlier observation), are shown in Fig. 3. An inspection of 
images in the 25-50 keV band shows that the fluxes observed in 2013 
and 2014 are similar, whereas at higher energies (>100 keV) there is 
excess at the position of SN 2014] only in the 2014 data. Previous ISGRI 
observations of this field in 2009-2012, with a total exposure of about 
6 X 10° s, revealed no significant signal at energies >50 keV from M82”. 

A combination of imaging and spectral analysis provides robust evi- 
dence of y-ray emission from SN 2014J. As expected, much of the sig- 
nal comes from the 800-900 and 1,200-1,300 keV bands, where two 
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Figure 1 | Gamma-ray lines from Co decay at 847 and 1,238 keV in the 
spectrum of SN 2014J. The spectrum was obtained by INTEGRAL between 
days 50 and 100 after the outburst. Red and blue points show SPI and 
ISGRI/IBIS data, respectively. The flux below 60 keV is dominated by the 
emission of M82. The black curve shows a fiducial model of the supernova 
spectrum for day 75 after the explosion. Inset, lower-energy part of the 
spectrum (black). The expected contributions of three-photon positronium 
annihilation (magenta) and Compton down-scattered emission from 847 
and 1,238 keV lines (green) are also shown. All error bars, 1 s.d. 
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prominent lines of °°Co should be. The best-fitting parameters (flux, 
energy and broadening) of those two lines are given in Extended Data 
Table 1. The fluxes at 847 and 1,238 keV, respectively (2.34 + 0.74) X 
10 “and (2.78 + 0.74) X 10 * photons s ‘cm 7, were corrected for 
the fraction of y-ray photons escaping the ejecta (this fraction is ~60%; 
Methods), yielding an estimate of Mc, = (0.34 + 0.07)Mo@ for the total 
°°Co mass at day 75 (Mo, solar mass). Correcting this value for the 
secular evolution of the *°Co mass in the decay chain °°Ni—> °°Co—> 
°°Fe, provides an estimate of the initial ° Ni mass: My; = (0.61 + 0.13)Mo. 

An independent estimate of the *°Ni mass can be obtained from the 
optical ‘bolometric light curve. The simplest approach is based on the 
assumption”' that the bolometric luminosity at the maximum is approx- 
imately equal to the power of the radioactive decay at this moment. For 
SN 2014J, the maximum bolometric luminosity is ~1.1 X 10*° ergs’, 
attained on day 17.7 after the explosion”, assuming interstellar extinction 
Ay = 1.7 mag in the V filter band. A thorough analysis of extinction” 
yields Ay = 1.85 + 0.11 mag, This implies a °°Ni mass of (0.42 + 0.05)Mo. 
This is marginally consistent with the estimates based on the y-ray emis- 
sion lines, which is not surprising, given the qualitative nature of this 
estimate and the large and complicated extinction in the direction of 
SN 2014J. A more direct test is the comparison of y-ray and bolometric 
optical, ultraviolet and infrared luminosities at day 75. The latter is es- 
timated to be ~1.1 X 10” ergs 1 (Methods). The total energy released 
during decay of the *°Co isotope” is split between neutrinos (~0.8 MeV), 
kinetic energy of positrons (~0.12 MeV) and y-rays (~3.6 MeV). In our 
fiducial model a fraction f ~ 0.77 of the luminosity in y-rays escapes the 
ejecta at day 75. The remaining fraction, 1 - f ~ 0.23, is deposited in the 
ejecta (ignoring bremsstrahlung radiation losses by electrons). Adding 
the kinetic energy of positrons and 23% of the y-ray luminosity pro- 
duced by 0.34M of *°Co yields an estimate of ~1.1 X 10” ergs’ for 


the rate of energy deposition in the ejecta, in good agreement with the 
optical data. The same model predicts that ~3.3 X 10°” ergs | escape 
the ejecta in the form of hard X-rays and y-rays. The observed luminosities 
of the 847 and 1,238 keV lines are ~4.7 X 10°’ and ~8.1 X 10“ ergs, 
respectively. 


4 
1 
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Figure 2 | Signatures of *°Co lines at 847 and 
1,238 keV in SPI images. The broad energy 
bands 800-880 keV and 1,200-1,300 keV are 
expected to contain the flux from °°Co lines, 
accounting for the expected broadening and 
shifting due to the ejecta expansion and opacity 
effects. The source is detected at 3.9 s.d. and 4.3 s.d., 
respectively, in these two bands. Colours show the 
signal-to-noise ratio at a given position. Data 
courtesy of IKI, MPA and the INTEGRAL team. 
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The emergent lines are expected to be broadened and blueshifted be- 
cause of ejecta expansion and the opacity effects (Methods and Extended 
Data Fig. 3). Both effects are indeed observed (Fig. 4). The mean blue- 
shift, averaged over both lines, corresponds to a velocity of Veni = —3,100 
+ 1,100kms7', and the broadening (root mean squared line-of-sight 
velocity) is ¢, = 4,100 + 960 kms° '. These values are broadly consist- 
ent with expectations of our fiducial model (Methods). 

Finally, at lower energies (100-400 keV) the emerging flux is domi- 
nated by Compton scattering of the 847 and 1,238 keV photons and the 
ortho-positronium continuum from positron annihilation (Fig. 1 inset 
and Extended Data Fig. 4). The predicted 100-400 keV flux is consist- 
ent with INTEGRAL data. Roughly half of the signal comes from the 
ortho-positronium continuum, suggesting that at day 75 the positrons 
produced in 19% of the °°Co decay are thermalized in the ejecta and an- 
nihilate via positronium formation. Below 100 keV, the emission declines 
strongly owing to photoabsorption. 

It should be possible to derive stronger constraints from the com- 
bined analysis of the whole spectrum. As a first step, we constructed a 
three-parameter (Myj ejecta mass, My; characteristic velocity, V.) model 
that is capable of reproducing the main observables in the spectrum: 
the y-ray line flux, the line broadening and the continuum flux below 
511 keV (Methods). The model assumes spherical symmetry, complete 
mixing ofall elements over the entire ejecta, and an exponential density 
profile?>?°: poce—*/"e, All three parameters are treated as independent. 
The level of mixing can in principle be determined from the time evo- 
lution of the y-ray flux. For example, early appearance of hard X-ray 
emission from core-collapsed SN 1987A clearly demonstrated that Co 
is mixed”’ over the ejecta. A Monte Carlo code follows the propagation 
of the y-ray photons through the ejecta and accounts for scattering and 
photoabsorption of photons and annihilation of positrons (Methods). 

We vary Myj, Mg; and V, over a wide range, calculate the expected 
emergent spectrum and compare it (in terms of a 7° test) with the 
observed spectrum. This procedure yields the following best-fit para- 
meters and 1 s.d. confidence intervals for the individual parameters: 
Myj = 0.567 § ¢Mo,Mgy = 1.24 93Mo, Ve = 3,000 + 800km s*. In this 


Figure 3 | Appearance of a new hard 

(100-600 keV) X-ray source at the position of 
SN 2014J. In the ISGRI image of the M82 field 
taken in 2013 the source is absent. Colours show 
the signal-to-noise ratio at a given position. 

SN 2014] is detected in this image at ~3.7 s.d. 
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Figure 4 | Broadening of the 847 and 1,238 keV lines. Red points show the 
SPI spectrum in the 720-920 keV range. The red histogram shows the line 
profile in the fiducial model. The blue points show the SPI spectrum of the 
1,238 keV line. For comparison, a Gaussian line at 800 keV with a width 
corresponding to the SPI intrinsic energy resolution is shown with a black line. 
Both observed lines are clearly broadened. The upper axis shows the velocity 
needed to shift the line to a given energy. All error bars, 1 s.d. 


model, the mass-weighted root mean squared velocity of the ejecta is 
~1/12V. =10,000 + 3,000 kms’ (Methods and Extended Data Fig. 5). 
The confidence contours in the Myj-Mg, plane are elongated such that 
the highest and lowest allowed values of Myj respectively correspond 
to the highest and lowest values of My,. 

In more realistic models, based on calculations of explosive nucleo- 
synthesis, the parameters are not independent and the distribution of 
elements over the ejecta can vary strongly. We therefore compared the 
expected spectra for several representative models of type Ia supernovae 
at day 75, scaled to the distance of M82, with the data. The list of models 
and corresponding Ay’ values with respect to the null hypothesis of 
no source are given in Extended Data Table 2. Remarkably, the canon- 
ical model of type Ia supernovae, W7', provides the best description of 
the SN 2014J spectrum, with Ay* = 54.4. The pure-detonation model 
DETO* produces too much °°Ni and can be reliably rejected. The sub- 
Chandrasekhar model HeD6” instead produces a y-ray flux that is too 
low and therefore can also be rejected. Our best-fitting three-parameter 
model (3PAR), and delayed-detonation models DD4*° and DDT 1p 
(E.B. et al., manuscript in preparation), designed to approximately match 
SN 2014] properties in the visual band, offers a gain in Ay* nearly as good 
as W7. From the standpoint of purely statistical errors, W7 performs 
significantly better than other models. However, given the inevitable 
assumptions and simplification in each model, and allowing for pos- 
sible systematic uncertainties, this group of models cannot be rejected. 

Overall, the good agreement with the canonical models shows that in 
y-rays SN 2014] looks like a prototypical type Ia supernova, even though 
strong and complicated extinction in the optical band makes the overall 
analysis challenging. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Observations. INTEGRAL is an ESA scientific mission dedicated to fine spectroscopy 
and imaging of celestial y-ray sources in the energy range from 15 keV to 10 MeV. 
The INTEGRAL data used here were accumulated during revolutions 1391-1407 
(ref. 31), corresponding to the period ~50-100 days after the explosion (proposals: 
1170002, PI: R.S.; 1140011, PI: J.1.; 1170001, public). Observations were performed 
ina standard 5 X 5 pattern around the nominal target location: one source on-axis 
pointing, 24 ‘off-source pointings’, with a 2.17° step. During off-source pointings, 
the source remains well within the INTEGRAL field of view. Periods of very high 
and variable background due to solar flares were omitted from the analysis (Ex- 
tended Data Fig. 1). The total exposure of the clean data set is ~2.6 Ms. 

SPI data analysis. SPI? is a coded-mask germanium spectrometer on board 
INTEGRAL. The instrument consists of 19 individual Ge detectors, and has a field 
of view of ~30° (at zero response), an effective area of ~70 cm? at 0.5 MeV andan 
energy resolution of ~2 keV. The effective angular resolution of SPI is ~2°. During 
SN 2014] observations, 15 out of 19 detectors were operating, resulting in slightly 
reduced sensitivity and imaging capabilities compared with the initial configura- 
tion. The data analysis follows the scheme implemented for the analysis of the Ga- 
lactic Centre positron annihilation emission*’™*. For each detector, a linear relation 
between the energy and the channel number was assumed and calibrated (separately 
for each orbit), using the observed energies of background lines (Extended Data 
Fig. 2) at ~198, 438, 584, 882, 1,764, 1,779, 2,223 and 2,754 keV. These bright lines 
are identified** with known nuclear transitions in Ge and other elements, activated 
by heavy particles, bombarding the SPI detector. For our analysis, we used a combi- 
nation of single and pulse-shape-discriminator (PSD) events”, and treated them in 
the same way. 

The count rate from the supernova at energy E, S(E), and the background rates 
in individual detectors, B;(E, t), were derived from a simple model of the observed 
rates, D,(E, t), in individual SPI detectors, where i is the detector number and t is 
the time of observation with a typical exposure of 2,000 s: D,(E, t) ~ S(E) X R,(E, t) 
+ B,E, t) + C(E). Here R,(E, t) is the effective area for the ith detector, as seen 
from the source position in a given observation, and C;(E) does not depend on time. 
The background rate is assumed to be linearly proportional to the Ge detectors’ 
saturated event rate, Ggat(t), above 8 MeV, averaged over all detectors; that is, 
BE, t) = BAE)Gsat(t). The coefficients S(E), B(E) and CE) are free parameters 
of the model and are obtained by minimizing 7’ for the entire data set. Even though 
the number of counts in individual exposures is low, it is still possible to use a plain 
7 approach as long as the errors are estimated using the mean count rate and the 
total number of counts in the entire data set is large*®. The linear nature of the model 
allows for straightforward estimation of statistical errors. 

As an example, we consider SPI data in the 1,200-1,300 keV range, containing 
the 1,238 keV line of °°Co. For the entire data set there are in total 14,340 flux mea- 
surements by individual SPI detectors with a typical exposure time of ~2,000 s. Fit- 
ting this data set with a constant background model (that is, D;(E, t) ~ C,(E)) yields 
the reduced 7”, = 1.31 per degree of freedom. Adding a B\(E, t) = B(E)Gsa:(t) term 
to the model reduces y’, to 0.998, consistent with expectations for photon counting 
noise. Further adding one more parameter—the flux of a source at the position of 
SN 2014J—lowers 7” by Ay” = 16.4, corresponding toa ~4 s.d. detection. For com- 
parison, for the 600-800 keV band, which does not contain strong lines, the improve- 
ment in 7” by adding the source flux to the model is only 0.60, giving no significant 
evidence for SN 2014J emission. 

Despite its proximity, SN 2014] is still an extremely faint source in y-rays. Al- 
though precise measurements of line fluxes are challenging, a combination of spec- 
tral and imaging information makes our results very robust. To further test the 
possible influence of variable background on our results, we repeated the calcula- 
tion of the SN 2014] spectrum while dropping the B,(E, t) term from the model. The 
line fluxes changed by ~3% compared with our more elaborate baseline background 
model. However, the wings of the lines, where the background is ~10* times higher 
than the source, can still suffer from the residual background features. The uncer- 
tainty in the distance D = 3.53 + 0.26 Mpc (ref. 18) causes an additional ~ 15% un- 
certainty in the flux and, therefore, in the estimated mass of radioactive Ni and Co. 
ISGRI/IBIS data analysis. The primary imaging instrument on board INTEGRAL 
is IBIS*’, which is a coded-mask aperture telescope with the CdTe-based detector 
ISGRI™. It has higher sensitivity to continuum emission than SPI in the 20-300 keV 
range and has a spatial resolution of ~12'. We note here that neither ISGRI nor SPI 
can distinguish the emission of SN 2014J from the emission of any other source in 
M82. ISGRI, however, can easily differentiate between M82 and M81, which are 
separated by ~30’. The energy resolution of ISGRI is ~ 10% at 100 keV. The ISGRI 
energy calibration uses the procedure implemented in OSA 10.0 (ref. 39). The images 
in broad energy bands were reconstructed using a standard mask-detector cross- 
correlation procedure, tuned to produce zero signal on the sky if the count rate 
across the detector matches the pattern expected from pure background, which 
was derived from the same data set by stacking detector images. The noise in the 
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resulting images is fully consistent with the expected level, determined by photon 
counting statistics. The fluxes in broad bands were calibrated using Crab Nebula 
observations with INTEGRAL shortly before the observations of SN 2014J dis- 
cussed here. 

Fitting 847 and 1,238 keV line parameters. The flux, energy centroid and broad- 
ening of the lines were evaluated by fitting a Gaussian to portions of the SPI spec- 
trum in the 800-900 and 1,100-1,350 keV bands. The best-fit parameters are given 
in Extended Data Table 1 along with 1 s.d. uncertainties. Although for plotting pur- 
poses we used heavily binned spectra (Figs 1 and 4), the fitting is done for the un- 
binned SPI spectrum in 0.5 keV-wide channels. 

Because the decay time of 5Co (t= 111.4 days) and branching ratios (1 and 
0.66 for the 847 and 1,238 keV lines, respectively) are known”, it is straightforward 
to convert line fluxes into the mass of *°Co visible to INTEGRAL at the time of 
observation: Mcy = (0.16 + 0.05)Mo and (0.27 + 0.07)Mo for the 847 and 1,238 keV 
lines, respectively. These values can be considered model-independent lower limits 
on the amount of *°Co at day 75 since the explosion. The fractions of line photons 
escaping the ejecta without interactions were estimated from our fiducial model as 
0.60 (847 keV) and 0.64 (1,238 keV). These values were used to correct the observed 
fluxes to derive estimates of the total °°Co mass at day 75, Mco = (0.26 + 0.08)Moa 
and (0.42 + 0.11)Mo, on the basis of the 847 and 1,238 keV line fluxes, respectively. 
The derived masses are consistent within the uncertainties with a mean value of 
Meo = (0.34 + 0.07)M ©. Finally, a correction factor of 1/0.55 has been applied to 
convert the mass of *°Co at day 75 to the initial mass of S°Ni: My; = (0.6 + 0.13) 
Mo. This factor accounts for time evolution of the Co mass in the decay chain 
from Ni to Fe. 

In fully transparent ejecta, the centroid of emerging y-ray lines should be un- 
shifted (at least to first order in V,/c, where c is the speed of light). Opacity sup- 
presses y-rays coming from the receding part of the ejecta, leading to a blueshift of 
the visible line. Blueshift is indeed observed for both lines (Fig. 4). 

The corresponding mean velocity is Vsnif¢ = —3,100 + 1,100 kms ~ 1 This value 
is slightly higher than the expected shift of — 1,280 km s~ | estimated from the fidu- 
cial model for y-ray photons escaping the ejecta without interactions. 

The expected line broadening (root mean squared line-of-sight velocity), o,,, for 
transparent ejecta is directly related to the characteristic expansion velocity. 

In the model with an exponential density profile, ¢,, = 2V.. Indeed, in the model 

we found for directly escaping photons, ¢ = 5,860 kms * ~ 2.1V.. The Gaussian 
fit to the observed lines yields a slightly lower value, 0, = 4,100 + 960km s — l Ttis 
possible that the exact values of the shift and broadening are affected by remaining 
uncertainties in the background model. However, the presence of shift and broad- 
ening at the level of few 10°kms_’ is very robust. 
Simplified model. The emergent y-ray spectrum from a type Ia supernova is deter- 
mined by interactions of y-ray quanta with the expanding ejecta and can be used as 
a proxy for the most basic properties of the supernova’’. Although detailed mod- 
elling of the properties of SN 2014J is beyond the scope of this Letter, we use a 
simple model to qualitatively compare our results with expectations. Our basic ob- 
servables are the line fluxes, primarily determined by My;; line broadening, set by 
the typical energy release per unit mass; and the flux below 511 keV, which is sen- 
sitive to the Thompson depth of the ejecta. Accordingly, we build a spherically sym- 
metric model of homologously expanding ejecta with mass My;, ignoring the possible 
anisotropy of the *°Ni distribution expected in the scenario of white dwarf mergers’. 
The density follows an exponential law*”* poce"/Y*, where V. = \/Ex /6Myj and 
is truncated at v= 10V,, and Ex is the kinetic energy of the ejecta. In this model a 
mass-weighted root-mean-squared velocity of the ejecta is /12V-. All elements, 
including radioactive Ni and Co, are uniformly mixed through the entire ejecta, 
which are composed of iron group elements (58%) and Si and S in equal propor- 
tions (21% each). This is of course a strong simplification, certainly violated in the 
outer regions and in the core, but it allows us to specify the model completely with 
only three parameters: Myj, Mg; and V_. The presence of the radioactive elements 
in the outer layers is crucial for the early phase of the supernova evolution, but 
becomes less important for days 50-100, which are the days of interest here. As a 
fiducial example, we use My; = 0.7Mo, Mg; = 1.38Mo and V, = 2,800 kms~ 1 cor- 
responding to Ex = 1.3 X 10°77 erg. 

A Monte Carlo radiative transfer code is used to calculate the emergent spectrum, 
which includes full treatment of Compton scattering (coherent and incoherent) 
and photoabsorption. Pair production by y-ray photons is neglected. The posi- 
trons produced by B* decay of *°Co (19% of all decays) annihilate in place via pos- 
itronium formation. Both two-photon annihilation into the 511 keV line and the 
ortho-positronium continuum are included. Our reference model was calculated 
for day 75 since the explosion. A time delay due to the finite propagation time of 
the photons is neglected (it amounts to few days from the radius where the bulk of 
the mass is located). 

Examples of the model spectra for days 50, 75 and 100 are shown in Extended 
Data Fig. 3. To a first approximation (justified by the low signal-to-noise ratio of 
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the observed spectrum), the model spectrum for day 75 can be used for compar- 
ison with the observed spectrum, accumulated by INTEGRAL over days 50-100. 
The contributions of various components to the total spectrum are shown in Ex- 
tended Data Fig. 4. The most prominent are the lines at 847 and 1,238 keV. These 
lines are broadened by the expansion velocity of the ejecta and escape without any 
interactions (the role of coherent scattering is negligible at these energies). The line 
shape is also modified by opacity effects, because y-ray photons produced in the 
nearest (approaching) side of the ejecta have a higher chance of reaching an observer. 
This effect causes the line shapes to be skewed towards the blue side. The scattered 
continuum associated with most prominent lines extends down to energies ~ 100 keV. 
At lower energies, photoabsorption becomes dominant and the flux drops. In ad- 
dition, three-photon annihilation, associated with the decay of ortho-positronium, 
produces a significant contribution to the continuum flux. Unlike the scattered con- 
tinuum, this component does not scale with the Thomson depth of the ejecta and 
becomes progressively more important for late evolution of the emergent spectrum. 
With our simple three-parameter model it is possible to run a grid of models to 
evaluate a plausible range of basic parameters. This was done by varying My;, Mg; 
and V,, comparing the model with the spectrum and calculating 7’. It is conveni- 
ent to express the success of the model by the reduction of Ay’ relative to the null 
hypothesis of no source. The null hypothesis gives 7” = 1,948.7 for 1,905 degrees of 
freedom for the combined SPI+ISGRI spectrum. The best-fitting three-parameter 
model has Ay” = 50.5, corresponding to a 7.1s.d. detection with My; = 0.56 Mo, 
Mg; = 1.2Mo and V, = 3,000kms ', corresponding to Ex = 1.3 X 10°! erg. The 
same model allows for calculation of 1 s.d. confidence intervals (for a single para- 
meter of interest) by identifying parameter space, which has Ay” smaller by 1 than 
the best-fitting model. Corresponding confidence intervals are shown in Extended 
Data Fig. 5. 
Specific explosion models. Apart from our simplified three-parameter model, we 
compare the spectrum (SPI+ISGRI) with the expected spectra calculated (E.B. 
et al., manuscript in preparation) for several detailed type Ia supernova explosion 
models. For each model, we calculate the reduction of A ¢ relative to the null hy- 
pothesis of no source. The resulting values and basic characteristics of the models 
are given in Extended Data Table 2. The set of models includes the classic W7 (ref. 1) 
and DD4 (ref. 30) models, the pure-detonation model, DETO”*, the sub- 
Chandrasekhar model, HED6 (ref. 29), our three-parameter model 3PAR with fi- 
ducial and best-fitting parameters, and several variants of the delayed-detonation 
model DDT 1p (E.B. et al., manuscript in preparation). DDT 1p4 was built to match 
SN 2014] in the visible band. DDT 1p1 isa slightly less energetic version of DDT 1p4. 
In the DDT 1p4halo model, the object is surrounded by a 0.2Mo ‘halo’, which can 
be envisaged in a slow merger scenario. 


Although the single-white-dwarf models discussed above provide consistent 
descriptions of the INTEGRAL data, these observations by themselves do not 
immediately exclude double-white-dwarf merger scenarios, provided that similar 
amounts of *°Ni are synthesized. In this regard, we note that no direct evidence 
that the progenitor of SN 2014] was a single accreting white dwarf has been found 
so far, placing tight limits on the most popular accretion scenarios”. 
Optical bolometric luminosity during INTEGRAL observations. Using recent 
BVRIJHK photometry*’, we estimated the supernova optical bolometric luminosity 
on day 75 after explosion (median of INTEGRAL observations). For Ay = 1.85 mag 
and Ry = 1.6, we found the bolometric luminosity of ~1.1X< 10” erg s_}, which 
agrees well with the estimates of deposited power in our fiducial model. 
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Extended Data Figure 1 | Variations in the particle background during lasts about 3 days. Periods of very high and variable background (shown in 
INTEGRAL observations. Anti-coincidence-system count rate is shown asa __ blue) due to solar flares were omitted from the analysis. Periods of quiescent 
function of time, expressed through the revolution number. One revolution background (red) were used to derive the spectrum of SN 2014J. 
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Extended Data Figure 2 | Comparison of the SPI background spectrum and 
the expected type Ia supernova emission. Typical quiescent background 
(black) and supernova model (red, convolved with SPI energy resolution) 
spectra. 
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Extended Data Figure 3 | Predicted spectra for days 50, 75 and 100 after 
explosion. The 3PAR model spectrum calculated for day 75 is used for 
comparison with the INTEGRAL data obtained between day 50 and 100 since 
the explosion. Weak lines below 200 keV correspond to *°Ni (day 50) and °”Co 
(day 100). 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


10-6 


ie 
° 
& 


Phot s-! em-? keV-! 


co) 
4 


10-8 


100 1000 
Energy, keV 


Extended Data Figure 4 | Contributions of various components to the 
model spectrum. The lines are formed by y-rays escaping the ejecta without 
interactions. The low-energy tail of each line is due to Compton down- 
scattering of the photons because of the recoil effect. The ‘humps’ in the tails 
correspond to the scattering by 180°. The magenta line shows the contribution 
of the ortho-positronium annihilation. Annihilation of para-positronium 
contributes to 511 keV line. 
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Extended Data Figure 5 | Confidence contours for our three-parameter model. The cross shows the best-fit values. Contours are plotted at Ay” = 1 with respect 
to the best-fit value and characterize the 1 s.d. confidence interval for a single parameter of interest. 
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Extended Data Table 1 | Parameters of the observed gamma-ray lines of °°Co 


Parameter 

Flux, 10° phot cm’ s" 
Luminosity, 10" erg s" 
Venit (l.0.s. velocity), km s! 


Line width (I.0.s. velocity rms), oy, km s' 


847 keV line 


2.3440.74 


4.7 


-1900+1600 


360021300 


1238 keV line 


2.7840.74 


8.1 


-4300+1600 


470021400 


Each line is independently approximated with a Gaussian. Errors, 1 s.d. 
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Extended Data Table 2 | Comparison of typical type la supernova explosion models with the data 


Model Mni, Moun Me;, Msun Ex, 10°" erg Ay? 
w7' 0.59 1.38 1.24 54.4 
DDT1p1 0.54 1.36 1.29 52.5 
pp4* 0.61 1.39 1.24 52.0 
DDT1p4 0.66 1.36 1.35 51.9 
3PAR, best-fitting 0.56 1.20 1.3 50.5 
3PAR, fiducial 0.70 1.38 13 49.3 
DDT1p4halo 0.62 1.55 1.3 49.1 
HED6” 0.26 0.77 0.72 38.2 
DETO” 1.16 1.38 1.44 12.1 
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The rightmost column shows the improvement in 7’, relative to the null hypothesis of no source, by assuming a type la supernova spectral model (no free parameters). The null hypothesis gives 7* = 1,948.7 for 


1,905 degrees of freedom. 
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Quantum imaging with undetected photons 


Gabriela Barreto Lemos’, Victoria Borish’*, Garrett D. Cole’, Sven Ramelow!?+, Radek Lapkiewicz!? & Anton Zeilinger** 


Information is central to quantum mechanics. In particular, quantum 
interference occurs only if there exists no information to distinguish 
between the superposed states. The mere possibility of obtaining 
information that could distinguish between overlapping states inhi- 
bits quantum interference’”. Here we introduce and experimentally 
demonstrate a quantum imaging concept based on induced coher- 
ence without induced emission**. Our experiment uses two separate 
down-conversion nonlinear crystals (numbered NL1 and NL2), each 
illuminated by the same pump laser, creating one pair of photons 
(denoted idler and signal). If the photon pair is created in NL1, one 
photon (the idler) passes through the object to be imaged and is over- 
lapped with the idler amplitude created in NL2, its source thus being 
undefined. Interference of the signal amplitudes coming from the 
two crystals then reveals the image of the object. The photons that 
pass through the imaged object (idler photons from NL1) are never 
detected, while we obtain images exclusively with the signal photons 
(from NL1 and NL2), which do not interact with the object. Our ex- 
periment is fundamentally different from previous quantum imag- 
ing techniques, such as interaction-free imaging’ or ghost imaging®”®, 
because now the photons used to illuminate the object do not have 
to be detected at all and no coincidence detection is necessary. This 
enables the probe wavelength to be chosen in a range for which suit- 
able detectors are not available. To illustrate this, we show images of 
objects that are either opaque or invisible to the detected photons. 
Our experiment is a prototype in quantum information—knowledge 
can be extracted by, and about, a photon that is never detected. 

The conceptual arrangement of our imaging technique, based on a 
quantum interference experiment** by Zou, Wang and Mandel, is illus- 
trated in Fig. 1. A pump beam (green) divided by a 50:50 beam splitter 
(BS1) coherently illuminates two identical nonlinear crystals, NL1 and 
NL2, where pairs of collinear photons called signal (yellow) and idler 
(red) can be created (|c)|d) in NL1 and |e)|f) in NL2). The idler am- 
plitude created in NLI reflects at the dichroic mirror D1 into spatial mode 
d, and signal amplitude passes into spatial mode c. The idler passes 
through the object O of real transmittance coefficient T and phase shift 
y | c). | d), Te!’ | C). | d),+V1—T? | C)s | w),, where for simplicity we lump 
all lost idler amplitude into a single state |w); (here subscripts s and i 
represent signal and idler). By reflection at dichroic mirror D2, the 
idler from NLI aligns perfectly with idler amplitude produced at NL2, 
|d), > |f);. The state at the grey dotted line is thus 


1 iy 
[tele + ed NA+ VI 1/4) (1) 
The idler is now reflected at the dichroic mirror D3 and discarded. The 
signal states | c), and | e) , are combined at the 50:50 beam splitter BS2. The 
detection probabilities at the outputs |g), and |h),, obtained by ignoring 
(tracing out) the idler modes, are 


1 
Poin 5 LET cosy] (2) 


Thus, fringes with visibility T can be seen at either output, even though the 
signals combined at BS2 have different sources*’”. These fringes appear in 


the signal single photon counts; the idlers are not detected. No coincidence 
detection is required. 

The peculiar feature of this interferometer is that no detected photon 
has taken path d. Yet, in our experiment, it is precisely here where we put 
the object to be imaged. The key to this experiment is how the signal- 
source information carried by the undetected idler photon depends on 
T. For, if T = 0, an idler detected after D3, coincident with a signal count 
at | g), or |h) ,» would imply the signal source was NL2. Detection of a 
signal photon without a coincident idler would imply the signal source 
was NL1. This which-source information destroys interference because 
it makes the quantum states overlapping at BS2 distinguishable. If 
T = 1, the idler photon carries no which-source information. The sig- 
nal states overlapped at each output of BS2 are then indistinguishable; 
thus the interference term in equation (2) appears. The above arguments 
are valid even though the idler photons are not detected, for it is only 
the possibility of obtaining which-source information that matters in 
this experiment. 

Our experiment has a connection to interaction-free measurements 
Note that P;, = 0 if no object is placed in the set-up (T = 1 and y = 0). 
Now insert an opaque object (T= 0) so that P;, > 0, and monitor the 
idler reflection from D3. Coincident counts in | h), and the idler detector 
reveal that the object is present even though no photon interacted with 
the object. With our set-up it is thus possible to realize non-degenerate 
interaction-free imaging. 

With O and D2 removed, equation (1) would be an ordinary two- 
particle entanglement", |c),|d); + |e),|f);. With them in, |d); > Te'’| f);, 
which creates equation (1). A normal two-particle entanglement has 
changed into an interesting single-particle superposition, which is es- 
pecially rich when T and y are transverse-position dependent. 

We expand the conceptual arrangement of Fig. 1 into an imaging 
system (Fig. 2). We replace the photon counters with cameras sensitive 
to single photons and the uniform object with one bearing features, that 
is, T = T(x, y) and y = (x, y) depend on transverse position (x, y). Our 
source produces spatially entangled photon pairs'*’’. Sharp spatial cor- 
relations between signal and idler in the object plane and confocal lens 
systems'° (see Methods) guarantee a point-by-point correspondence bet- 
ween the object plane and the detector surface on the camera. 

The intensity image (non-constant transmittance) is due to transverse- 
position-dependent which-source information carried by the undetected 
idler photons. The phase image is of a different nature: it is due to the fact 
that the position-dependent phase shift on the idler photons in path d is 
actually passed to the signal; that is”, |c) , (Te’” |f):) + | é). lf); = (Te | Che 
+ |e),)|f);-Remarkably, the idler beam | f); alone does not even carry the 
phase pattern, and without detection in coincidence it could not be used 
to obtain the phase image’*”. 

We will now show images obtained by detecting 810-nm photons 
with a camera capable of single-photon sensitivity at this wavelength, 
when three different objects are illuminated by 1,550-nm photons, to 
which our camera is blind (see Methods). First, a cardboard cut-out 
placed into the path D1-D2 is imaged. Next, we show that a position- 
dependent phase shift produces an image even when the object is opaque 
(an etched silicon plate) or invisible (etched silica plate) at the detection 
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Figure 1 | Schematic of the experiment. Laser light (green) splits at beam 
splitter BS1 into modes a and b. Beam a pumps nonlinear crystal NL1, where 
collinear down-conversion may produce a pair of photons of different 
wavelengths called signal (yellow) and idler (red). After passing through the 
object O, the idler reflects at dichroic mirror D2 to align with the idler produced 
in NL2, such that the final emerging idler |f); does not contain any information 
about which crystal produced the photon pair. Therefore, signals |c), and 

|e), combined at beam splitter BS2 interfere. Consequently, signal beams |g), 
and |h), reveal idler transmission properties of object O. 


wavelength. The images obtained with an electron multiplying charge 
coupled device (EMCCD) camera show single (non-heralded) counts 
per pixel (16 lum X 16 um) obtained in an exposure time of 0.5 s with an 
electron multiplying gain factor of 20. The visibility achieved in the 
experiment is 77% (see Methods for details). 

Figure 3a shows the beamsplitter output when a cardboard cut-out 
(illustrated in Fig. 3b) is inserted in the path D1-D2. Constructive inter- 
ference is seen at one output of the beam splitter and destructive inter- 
ference is observed in the other output. Interference only occurs in the 
region corresponding to the idler beam transmitted through the shape 
cut out of the cardboard, as seen in the sum and difference of the com- 
plementary images, shown in Fig. 3c and d, respectively. The sum of the 
two outputs of the beamsplitter gives the featureless intensity profile of 
the signal beams, demonstrating that the signal beams, while carrying the 
intensity information, are not absorbed at all by the mask. 

In Fig. 4a, we show the image of an etched 500-m-thick silicon plate; 
the plate is shown in Fig. 4b (see Methods section for details of the silicon 
plate and the etching process). Silicon is opaque to illumination at 810 nm, 
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Figure 2 | Experimental set-up. A continuous-wave 532-nm laser (green) 
illuminates crystals NL1 and NL2. Wave plates (WPs) adjust the relative phase 
and intensity of the outputs of the polarizing beam splitter (PBS). The dichroic 
mirror D1 separates down-converted 810-nm (yellow) and 1,550-nm (red) 
photons. The 1,550-nm photons are transmitted through the object O and sent 
through NL2 by dichroic mirror D2. Lenses image plane 1 onto plane 3, and 
plane 2 onto the EMCCD camera. A 50:50 beam splitter (BS) combines the 
810-nm beams. Dichroic mirrors D1, D2, D4 and D5 transmit the pump. 
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Figure 3 | Intensity imaging. a, Inside the cat, constructive and destructive 
interference are observed at the outputs of BS when we placed the cardboard 
cut-out shown in b in the path D1-D2. Outside the cat, idler photons from 
NL1 are blocked and therefore the signals do not interfere. c, The sum of the 
outputs gives the intensity profile of the signal beams. d, The subtraction of 
the outputs leads to an enhancement of the interference contrast, as it highlights 
the difference between constructive and destructive interference. 


thus it is impossible to realize transmission imaging by illuminating the 
silicon with 810-nm photons. However, silicon is highly transparent at 
1,550 nm and when we place the object in path D1-D2, the difference in 
optical path length for the etched and non-etched regions corresponds to 
a relative phase shift of m. Even though our camera is blind to 1,550-nm 
light, the image is seen by detecting 810-nm photons at the output of BS2 
(Fig. 4a). 

Finally, Fig. 5a shows the image of a fused silica (SiO) plate etched 
with a pattern that is invisible at the detection wavelength (details are 
given in the Methods section). We take advantage of the flexibility of our 
source to obtain collinear non-degenerate down-conversion at 820 nm 
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Pixels 
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Figure 4 | Phase image of an object opaque to 810-nm light. a, Detection of 
810-nm photons at both outputs of BS when a silicon plate (opaque to 810-nm 
light) with a 3-mm-tall etched cat (b) was introduced in path D1-D2. b, Three- 
dimensional rendering of the etch design overlaid with stylus profilometer 
scans (blue points) of the actual etch depth. 
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Figure 5 | Phase imaging of a 2m step at 820 nm. a, The top picture was taken 
with the object (shown in b) placed in the 820-nm beam between L4 and L4’; 
in the bottom picture, the object was placed in the 1,515-nm beam in path 
D1-D2. b, Three-dimensional rendering of the design overlaid with stylus 
profilometer scans (blue dots) of the actual etch depth. 


and 1,515 nm (see Methods). The object (Fig. 5b) has an etch depth of 
1,803 nm, imparting a relative phase shift of ~27 for 820-nm light. Thus 
the object is invisible when placed in the path of the detected photons 
between L4 and L4’ (top of Fig. 5a). This same etch depth gives an ~1 
phase step for 1,515-nm light, so when this same object is placed in the 
path D1-D2 of undetected photons, an image seen in the contrast of 
constructive to destructive interference is retrieved in the 820-nm out- 
puts (bottom of Fig. 5a). 

In summary, we have presented a quantum system for intensity and 
phase imaging where the photons that illuminate the object are not 
detected and the photons that are detected do not illuminate the object. 
We image objects that are either opaque or invisible at the detection 
wavelength (near-infrared) by illuminating three different objects with a 
wavelength to which our detector is blind. This experiment is funda- 
mentally different to ghost imaging*” as it relies on single-photon inter- 
ference and does not require coincidence detection. Furthermore, our 
technique could be used for non-degenerate interaction-free imaging, 
with potential applications spanning biological imaging to the inspec- 
tion of integrated circuits. Our system can realize grey-scale intensity 
or phase imaging, and it can be modified in order to measure spectral 
features (spectral imaging)”. 

We have demonstrated that our technique does not require the laser or 
the detector to function at the same wavelength as that of the light probing 
the object. Additionally, any nonlinear process can be used as a source, and 
this provides flexibility in the wavelength range for both detection and 
illumination of the object. In particular, in spontaneous parametric down- 
conversion (as used here), the only absolute restriction is that the sum of 
the two photon energies equals that of the pump photons. We have shown 
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that information can be obtained about an object without detecting the 
photons that interacted with the object. Knowing the two-photon state, 
one can obtain information about an object. It has not escaped our atten- 
tion that, on the other hand, by knowing the object, one could obtain 
information about the quantum state without detecting it. 


METHODS SUMMARY 


A detailed schematic of our imaging set-up is shown in Fig. 2. A 532-nm linearly 
polarized Gaussian pump laser beam focused by lens L1 on plane 1 is divided at a 
polarizing beam splitter (PBS) and coherently illuminates two identical periodically 
poled potassium titanyl phosphate (ppKTP) crystals, NL1 and NL2. The PBS plus 
wave plates (WPs) are used to control the relative amplitudes and phases between the 
reflected and transmitted pump beams. With an extra half-wave plate (HWP) in the 
reflected beam, both beams have the same polarization. The 1,550-nm idler ampli- 
tude produced at NL1 is reflected by dichroic mirror D1, through which the 810-nm 
signal and the pump are transmitted. Dichroic mirror D4 transmits 532-nm light 
and reflects 810-nm light. A long-pass filter (not shown in the figure) placed directly 
before the object O blocks any residual 532-nm or 810-nm light. The 1,550-nm 
amplitude from NL1 illuminates the object O and is then overlapped with the pump 
beam at dichroic mirror D2 that transmits 532-nm light and reflects 1,550-nm light. 

Lens pairs L2-L2’, L3-L3', and L4-L4' image plane 1 onto plane 3, thereby ensur- 
ing that pump, idler and signal, respectively, are identical in these planes, thus con- 
tributing to obtain high interference visibility”! (see Methods). Lenses L5 and L6 
together with L3’ and L4’ image object plane 2 onto the camera surface. 

The 810 + 1.5 nm photons are detected (without heralding) in both outputs of the 
BS using an EMCCD camera that exhibits single-photon sensitivity at 810 nm, but 
has a negligible response at 1,550 nm. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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Carbon-carbon bond cleavage and rearrangement of 
benzene by a trinuclear titanium hydride 


Shaowei Hu!, Takanori Shima’? & Zhaomin Hou)? 


The cleavage of carbon—carbon (C—C) bonds by transition metals 
is of great interest, especially as this transformation can be used to 
produce fuels and other industrially important chemicals from nat- 
ural resources such as petroleum and biomass. Carbon— carbon bonds 
are quite stable and are consequently unreactive under many reaction 
conditions. In the industrial naphtha hydrocracking process, the aro- 
matic carbon skeleton of benzene can be transformed to methylcy- 
clopentane and acyclic saturated hydrocarbons through C—C bond 
cleavage and rearrangement on the surfaces of solid catalysts’ °. How- 
ever, these chemical transformations usually require high temperatures 
and are fairly non-selective. Microorganisms can degrade aromatic 
compounds under ambient conditions, but the mechanistic details 
are not known and are difficult to mimic’. Several transition metal 
complexes have been reported to cleave C—C bonds in a selective fash- 
ion in special circumstances, such as relief of ring strain, formation of 
an aromatic system, chelation-assisted cyclometallation and f-carbon 
elimination® °. However, the cleavage of benzene by a transition metal 
complex has not been reported’* ’. Here we report the C—C bond 
cleavage and rearrangement of benzene by a trinuclear titanium poly- 
hydride complex. The benzene ring is transformed sequentially to a 
methylcyclopentenyl and a 2-methylpentenyl species through the 
cleavage of the aromatic carbon skeleton at the multi-titanium sites. 
Our results suggest that multinuclear titanium hydrides could serve 
as a unique platform for the activation of aromatic molecules, and 
may facilitate the design of new catalysts for the transformation of 
inactive aromatics. 

The investigation of the reactivity of multimetallic complexes with 
benzene is of great interest and importance, in view of the fact that the 
activation of aromatic compounds on industrial solid catalysts might rely 
on the cooperation of multiple metal sites’*”°*". In our recent studies 
on multinuclear rare-earth and transition metal hydride complexes, we 
found that the multiple metal—hydride sites could have unique syner- 
gistic effects on the activation of small molecules such as CO, CO2, Hz 
and N, (refs 22-26). In particular, the unusually high reactivity of the 
trinuclear titanium heptahydride complex [(C;Me,SiMe3) Ti] 3([13-H) 
(t-H)¢ (1)”° motivated us to examine carefully whether the hydride clus- 
ter could activate other chemically inactive molecules such as benzene. 

When a benzene solution of 1 was left to stand at room temperature 
(22 °C) under an argon atmosphere for a few days, a methylcyclopen- 
tenyl complex [(C;Me,SiMe3)Ti]3(Cs;H,Me)(1-H),4 (2) was formed as 
dark green crystals. Complex 2 could be obtained almost quantitatively, 
when a benzene solution of 1 was kept at 40 °C for 36h (Fig. 1a). An 
X-ray diffraction study revealed that 2 contains one methylcyclopen- 
tenyl (MeC;H,) moiety, three (C;Me,SiMe;)Ti units and four hydride 
ligands (Fig. 1a). The methylcyclopentenyl moiety formally bears three 
negative charges, one being distributed among C39, C40 and C41 to form 
an allylic moiety and two being placed on C37 and C38. Four (C37, C38, 
C39, C41) of the five ring carbon atoms in the [MeC;H,]* unit are 
u-n*-bonded to the Tis triangle. 

Remarkably, here a benzene molecule was partly hydrogenated and 
ring-contracted to a five-membered ring species, [MeC;H,]. This reaction 


required breaking a robust aromatic C—C bond of benzene and mak- 
ing anew C—C bond. In this transformation, three of the seven hydride 
ligands in 1 were consumed, one being transferred to benzene and two 
released as H2 by donating two electrons, thus affording the trianionic 
[MeC;H,]*~ species. The formal oxidation state of the Tiatoms in 1 and 
2 remained unchanged; both complexes formally contain one Ti(Iv) and 
two Ti(im) atoms”®. 

When °C-enriched benzene '*C,Hg was used to react with 1 under 
similar conditions, the corresponding '*C-enriched methylcyclopentenyl 
complex, [(C;Me,SiMe;)Ti] 3(°°Cs5H4'°CH3)(1-H)4 (2-77), was obtained 
(Supplementary Information), confirming that the methylcyclopente- 
nyl unit in 2 was indeed derived from benzene. The reaction of 1 with 
benzene-d, (C,D¢) afforded [(C;Me,SiMe3)Ti]3[C5;Z4CZ3] (U-Z)4 (Z = 
Hp7-0.5D0.3-0.5) (2-HD), in which H and D atoms are randomly distrib- 
uted between the methylcyclopenteny] unit and the titanium framework 
as shown by 'H and *H NMR spectroscopy. 

To isolate a possible reaction intermediate, we kept a benzene solution 
of 1 at a lower temperature (10 °C) for about two days. Evaporation of 
the solvent under vacuum yielded dark-green solids, which appeared to 
be a mixture of 2 (25%) and a new compound, [(C;Me,SiMe;)Ti]3(U- 
1°-CeH,)(1-H)3 (4) (75%), as shown by ‘HNMR spectroscopy. Although 
complete separation of 2 and 4 was difficult, a single crystal of 4 suit- 
able for X-ray diffraction studies was obtained by recrystallization in 
tetrahydrofuran (THF). We found that 4 is formally composed ofa tetra- 
anionic [CsH.]*_ moiety and a tetra-cationic titanium trihydride unit, 
[Cp’;Ti;H;]** (Fig. la). The [CsH,] unit is u-n®-bonded to the Ti; frame- 
work. The 'H NMR spectrum of 4 exhibited a singlet at a chemical shift 
of 6 = 4.88 p.p.m. (tetramethylsilane reference) for the six protons of the 
[CsHg] unit in a temperature range of 22 °C to -80 °C, showing that the 
structure is highly fluxional. 

Complex 4 decomposed slowly to unidentified products at room tem- 
perature. The direct transformation of 4 to 2 was not observed. When 4 
was exposed to an atmosphere of H2 in THF-dg, the immediate forma- 
tion of another new compoundassignable to [(C;Me,SiMe3)Ti]3(CsH7) 
(1-H), (3) was observed (Supplementary Information; see also 6 in Fig. 2, 
and below). In this reaction, the [C;H,]* unit in 4 was formally hydro- 
genated (or protonated) to [CsH,]*~. 

When kept at 40 °C for about 3 h, 3 was transformed quantitatively 
to the methylcyclopentenyl [MeCsH,]*~ complex 2 through ring con- 
traction of the [CsH7]*~ unit (Fig. 1a). This transformation occurred even 
at room temperature, albeit slower. Kinetic studies on the transformation 
of 3 to 2 revealed the activation parameters of AH” = 21.7(6) kcal mol! 
and AS” = —4.7(19) entropy units (Supplementary Information). The 
entropy of activation is in agreement with a unimolecular process. In 
the absence of H2, 3 rapidly changed to 4 by release of H2, demonstrat- 
ing that 3 and 4 are easily interconvertible through hydrogenation/ 
dehydrogenation. 

To gain more information on the reaction process, we monitored the 
reaction of 1 with benzene-d, (C,D¢) at 22 °C by 'H NMR spectros- 
copy. With the decrease of 1, gradual formation of 2-HD was observed 
(Fig. 1b). In this process, simultaneous formation of two species assignable 
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Figure 1 | Reactions ofa trinuclear titanium heptahydride complex (1) with 
benzene and benzene-d,. a, Reaction of 1 with benzene (Cp’ = C;Me,SiMes). 
The X-ray core structures of 2, 4 and 5 are shown as ball and stick diagrams 


to 3-HD and 4-HD was also observed. After 78 h, 3-HD, 4-HD and 1 
all disappeared to yield 2-HD almost quantitatively. 

The above experimental results suggest that 3 may be first formed by 
the reductive hydrogenation of benzene by the heptahydride complex 
1 through hydride transfer and H, elimination (Fig. 1a). There is an equi- 
librium between 3 and 4 through dehydrogenation and rehydrogenation. 
The isomerization (ring contraction) of the [CgH,] unit in 3 would finally 
yield the methylcyclopentenyl complex 2. 

When 2 was heated at 100 °C for two days in a sealed Schlenk tube, 
insertion ofa Ti atom into a C-C bond of the methylcyclopentenyl rin; 
took place, yielding a titanacycle product, [(C;Me,SiMe3)Ti]3[ u-n?n, 
1°-CHC(Me)(CH)s] (1-H) (5), as a major product (about 75%) (Fig. 1a). 
This transformation was accompanied by liberation of H2. Heating 
2-'3C under the same conditions (in benzene) afforded the corresponding 


(Cp’ ligands omitted for clarity). Me, methyl. b, Conversion versus time for the 
reaction of 1 with benzene-d, at 22 °C. The solid lines are interpolations. 


*C-enriched complex, 5-'°C, suggesting that this rearrangement should 
bea unimolecular process. Complex 5 could formally be viewed as a com- 
bination ofa penta-anionic 2-methylpentenyl group, [CHC(Me)(CH)5] 7 
and a penta-cationic titanium dihydride unit, [(C;Me,SiMe;) 3Tiz3H)]°. 
An X-ray diffraction study established that one (C;Me,SiMe3)Ti unit is 
incorporated into the 2-methylpentenyl framework to form a planar, six- 
membered titanacycle. Each side of the metallacycle plane is 1°-bonded 
toa (C,Me,SiMe;)Ti unit, leading to formation of a double stacker struc- 
ture. There are two hydride ligands in 5, each bridging the Ti atom in 
the metallacycle and one Ti atom outside of the metallacycle. 

For comparison, we also examined the reaction of 1 with toluene. When 
a toluene solution of 1 was kept at 20 °C for 42 h, the partial hydrogenation 
of toluene took place to give [(C;Me,SiMe3) Ti] 3(CgH¢Me) (1-H), (6), 
which contains a [Cg;HgMe]*” unit and could be viewed as an analogue 
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8 (62% NMR yield) 


9 (33% NMR yield) 


of 3 formed in the reaction of 1 with benzene (Fig. 2; see also Fig. 1a). A 
single crystal of 6 suitable for X-ray diffraction studies was obtained by 
recrystallization under an atmosphere of H, in hexane. It was revealed 
that there is an agostic interaction between a C-H bond in the CsHsMe 
unit, which is formed by hydrogenation of toluene at the ipso-position, 
and a Ti atom (Ti2) (Ti2-C10: 2.326(3) A; Ti2-H10: 2. 11(3) A). There 
are four hydride ligands in 6, one capping the Ti; triangle in a [13-fashion 
and three bridging the Ti: * Ti sides in a [1.-mode, analogously to complex 
2 (Figs 2 and 1a). In agreement with the solid structure, the ipso-proton in 
the [CsHgMe] unit in 6 exhibited a singlet at a high field 6 = -2.23 p.p.m. 
in the ‘H NMR spectrum, showing that an agostic Ti:**H interaction 
remained in solution. The '*C-enriched analogue 6-'*C, prepared by 
the reaction of 1 with °C-enriched toluene (1?C;H;'*CH;), showed a 
doublet at 6 = -2.23 p.p.m. with a relatively small '*C-'H coupling con- 
stant (Jc_4 = 108 Hz) in the ‘H NMR spectrum, thus providing further 
proof for the presence of an agostic Ti: *:H interaction (for comparison, 
normally Jc_y = 120-130 Hz; ref. 27). 

In the absence of H2, the dehydrogenation of 6 took place rapidly to 
give [(C;Me,SiMe3)Ti]3(CeHsMe) (1-H); (7) (Fig. 2), an analogue of 4 
formed in the case of benzene. When 6 (or 1 plus toluene) was heated 
in a THF-dg solution at 60 °C for 36 h, the six-membered ring metalla- 
cycle products [(C;Me,SiMes)Til]3[ u-17,1°,1°-CHC(Me)(CH).C(Me)] 
(1-H), (8) (62% NMR yield) and [(C;Me,SiMe;)Ti]3[u-1,1,1-CHC 
(Me)(CH)C(Me)CH](t1-H)2 (9) (33% NMR yield), which are analogues 
of 5 formed in the case of benzene, were obtained. The formation of the 
two regioisomers 8 and 9 suggests that a dimethylcyclopentenyl species 
such as X (an analogue of 2; Fig. 2) should be an intermediate in this reac- 
tion. But in contrast to the reaction of benzene, such a five-membered 
ring intermediate was not observed in the case of toluene (at either 60 °C 
or lower temperatures), possibly because of its instability caused by the 
steric hindrance of the methyl groups on the cyclopentenyl ring. 

Obviously, the reactions of the hydride cluster 1 with benzene and 
toluene involved various events, including C = C double-bond hydro- 
metallation, H, release, and C-H and C-C bond cleavage and formation. 
It is certainly due to the cooperation of the multiple Ti-H sites that all 
of these different events could be accomplished at once, although the 
mechanistic details are subject to further studies. It is also worth noting 
that the C-C bond cleavage of cyclic alkenes such as cyclopentadiene 
and cyclohexene has been successfully achieved by using multimetallic 
ruthenium hydride and carbonyl complexes”, but the breaking of a 
benzene ring did not take place under similar conditions”. The present 
sequential transformation of benzene to 2 and 5 by the hydride cluster 
1 also stands in contrast with the hydrocracking reaction of a benzene 
ring on solid catalysts, which gave a mixture of methylcyclopentane and 
acyclic saturated hydrocarbons’ *. This work thus demonstrates that 
multimetallic titanium hydrides such as 1 can serve asa unique platform 
for the activation of aromatic molecules, offering new opportunities for 
the transformation and functionalization of inactive aromatics. 


METHODS SUMMARY 


All reactions were performed under an argon atmosphere. Complex 1 was prepared 
according to literature procedures” and stored in a glovebox. 

Reaction of 1 with benzene. A benzene solution (5.0 ml) of 1 (151 mg, 0.21 mmol) 
in a 30 ml Schlenk tube was stirred at 40 °C for 36 h. Removal of the solvent under 
vacuum gave [(C;Me,SiMe;3)Ti]3(Cs;H4Me)(-H), (2) (163 mg, 98%) as a dark-green 
solid. Recrystallization from hexane at -33 °C afforded dark-green crystals (121 mg, 
0.15 mmol, 71%) suitable for X-ray diffraction. 
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Contrasting responses of mean and extreme snowfall 


to climate change 


Paul A. O’Gorman! 


Snowfall is an important element of the climate system, and one that 
is expected to change in a warming climate’*. Both mean snowfall 
and the intensity distribution of snowfall are important, with heavy 
snowfall events having particularly large economic and human 
impacts’’. Simulations with climate models indicate that annual 
mean snowfall declines with warming in most regions but increases 
in regions with very low surface temperatures**. The response of heavy 
snowfall events to a changing climate, however, is unclear. Here I show 
that in simulations with climate models under a scenario of high 
emissions of greenhouse gases, by the late twenty-first century there 
are smaller fractional changes in the intensities of daily snowfall ex- 
tremes than in mean snowfall over many Northern Hemisphere land 
regions. For example, for monthly climatological temperatures just 
below freezing and surface elevations below 1,000 metres, the 99.99th 
percentile of daily snowfall decreases by 8% in the multimodel med- 
ian, compared to a 65% reduction in mean snowfall. Both mean and 
extreme snowfall must decrease for a sufficiently large warming, but 
the climatological temperature above which snowfall extremes decrease 
with warming in the simulations is as high as —9 °C, compared to 
—14°C for mean snowfall. These results are supported by a phys- 
ically based theory that is consistent with the observed rain-snow 
transition. According to the theory, snowfall extremes occur near 
an optimal temperature that is insensitive to climate warming, and 
this results in smaller fractional changes for higher percentiles of 
daily snowfall. The simulated changes in snowfall that I find would 
influence surface snow and its hazards; these changes also suggest 
that it may be difficult to detect a regional climate-change signal in 
snowfall extremes. 

Extremes of daily precipitation (including liquid and solid precipita- 
tion) are found to increase in intensity with climate warming in obser- 
vations and simulations*”°, and this is physically consistent with greater 
saturation-specific humidities in a warmer atmosphere’. However, 
little is known about the physical basis for changes in daily snowfall ex- 
tremes, their past changes on a global or hemispheric scale, or how they 
change in global-climate-model simulations. Regional observational stud- 
ies show large interdecadal variations in measures of snowfall extremes'*””, 
but long-term trends remain unclear. Extremes of seasonal mean snow- 
fall have been studied previously'®”’, but daily snowfall extremes may 
respond differently’*. Physically, we would expect heavy snowfall events 
to occur in a relatively narrow range of temperatures below the rain- 
snow transition; at much lower temperatures it is not ‘too cold to snow’ 
but low saturation-specific humidities make heavy snowfall unlikely. How- 
ever, it is not clear what this means for the response to climate change, 
and previous studies have differed in their findings as to whether heavy 
snowfall events are predominantly associated with anomalously cold 
or warm years (or seasons) in the present climate’*”*. Variability of daily 
temperatures is another factor that must be taken into account, and cold 
extremes are expected to persist to some extent in a warming climate”. 

I show here, using simulations and a physically based theory, that 
snowfall extremes respond more weakly to climate warming than does 
mean snowfall in many regions. The simulations were performed with 
20 climate models and were taken from the World Climate Research 


Programme's Coupled Model Intercomparison Project phase 5 (CMIP5), 
which is the first phase of the project to archive daily snowfall (Methods). 
Climate change was calculated as the difference between the historical 
simulations (1981-2000; the control climate) and the representative con- 
centration pathway (RCP) 8.5 simulations (2081-2100; the warm climate). 
The snowfall variable is accumulated daily, includes all solid precip- 
itation at the surface and is expressed in liquid-water equivalent per day 
(extremes of snowfall depth are discussed in the Methods). For sim- 
plicity, only Northern Hemisphere land was considered, and results are 
presented as the multimodel median of the ratio of snowfall rates in the 
warm versus the control climate. 

Daily snowfall extremes were first measured by their 20-year return 
values, calculated by fitting the generalized extreme value distribution 
to the time series of annual maximum daily snowfall in each grid box 
(Methods). Compared to observational estimates of snowfall, the con- 
trol simulations capture the magnitudes and many of the features of mean 
and extreme snowfall, with some regional biases (Extended Data Figs 1 
and 2). Climate warming in the simulations causes widespread decreases 
in mean snowfall at middle latitudes (Fig. 1a), consistent with previous 
studies*“. In contrast, the snowfall extremes have a relatively muted 


Ratio of snowfall extremes 
= 1 


Latitude (°N) 


Longitude (°E) 


Figure 1 | Ratios of snowfall for the warm climate compared with the 
control climate. a, b, Multimodel-median ratios (colour scale) of mean 
snowfall (a) and daily snowfall extremes as measured by their 20-year return 
values (b). The 20-year return values were estimated using a fit of the 
generalized extreme value distribution to the annual-maximum time series. 
Ratios are only shown for land grid boxes where the multimodel-median of 
mean snowfall is greater than 5 cm per year in the control climate. White 
hatching denotes regions with surface elevations above 1,000 m that are not 
included in Figs 2-4. 
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response, with substantially smaller fractional changes than for mean 
snowfall in many regions (Fig. 1b). 

Snowfall statistics and their changes are expected to be strongly de- 
pendent on the climatological temperature, which varies by month and 
region. To quantify this dependence, I next analysed the changes in 
snowfall as a function of the climatological monthly surface air temper- 
ature in the control climate. Daily snowfall rates were aggregated in 5 °C 
bins with centres from -22.5 °C to 12.5 °C according to the climato- 
logical monthly surface air temperature in the control climate for each 
grid box and day. Snowfall extremes were calculated as high percen- 
tiles of the daily snowfall rates in each temperature bin, including days 
with no snowfall. Both mean snowfall and snowfall extremes in the dif- 
ferent temperature bins are in good agreement with observational esti- 
mates (Extended Data Fig. 3). The response to climate change is first 
presented for surface elevations below 1,000 m (Extended Data Fig. 4). 
Fractional decreases are greater for mean snowfall than for snowfall ex- 
tremes for much of the temperature range considered here (Fig. 2a), which 
demonstrates the contrasting responses of mean and extreme snowfall 
even when monthly variations in climatological temperature are con- 
trolled for. For the temperature bin centred around —2.5 °C, mean 
snowfall decreases by 65% in the multimodel median, whereas the 99.99th 
percentile of snowfall decreases by only 8%. Changes in snowfall extremes 
transition from positive to negative at control-climate temperatures as 
high as —9 °C, whereas the corresponding temperature for mean snow- 
fall is —14 °C. Furthermore, the difference in behaviour between mean 
and extremes is greater the higher the percentile of snowfall considered 
(Fig. 2a), and this difference is robust across different climate models 
(Extended Data Fig. 5). 

I next present a simple theory that accounts for the main features of 
the response of snowfall extremes to climate change. The theory does 
not include the response of mean snowfall, but this has been explained 
previously in terms of changes in mean precipitation and temperature”. 
Surface precipitation type depends on the vertical temperature profile 
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Figure 2 | Ratios of snowfall for the warm climate compared with the 
control climate as a function of climatological monthly surface air 
temperature in the control climate. Multimodel-median ratios of mean 
snowfall (red) in both panels. a, Multimodel-median ratios of the 99th, 99.9th 
and 99.99th percentiles of daily snowfall in increasing order from light to dark 
grey. b, Multimodel-median ratio of the 99.99th percentile of daily snowfall 
(grey line; shading shows the interquartile range), and the same ratio calculated 
from theory according to equation (1) (green dashed) and equation (2) (green 
dashed-dotted). Only land grid boxes in the Northern Hemisphere with 
surface elevation below 1,000 m are included. 
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of the lower troposphere”, but to first order it may be related to surface 
air temperature’. The daily snowfall rate s in the theory is related to 
the daily precipitation rate p by s = f(T)p, where T is the daily surface 
air temperature, and f(T) is the snowfall fraction (the fraction of pre- 
cipitation that falls as snow at a given temperature T). The f(T) diag- 
nosed from the simulations shows a sharp decline near freezing (Fig. 3), 
and this is comparable to what is found in observations (Extended Data 
Fig. 6). As expected given modest changes in lapse rates (the rates of 
decrease of temperature with height), f(T) is almost exactly the same in 
the control and warm climates (Fig. 3). 

The daily precipitation rate in the theory is assumed to have a simple 
dependence on surface air temperature according to p= ef Tp, where 
B = 0.06 °C‘ isa representative thermodynamic rate of increase of ex- 
tratropical precipitation extremes with respect to surface temperature 
related to changes in saturation-specific humidity'*. The normalized 
precipitation variable p may be thought of as a dynamic variable closely 
related to upward motion in the atmosphere; it is assumed to follow a 
gamma distribution on wet days with scale parameter y_' and shape 
parameter k. The fraction of wet days is denoted w. The temperature T 
is assumed to be normally distributed with mean T and standard devi- 
ation o, and p and T are taken to be independent. 

With these assumptions, asymptotic methods were used to evaluate 
the integrals over temperature and p involved in the calculation of high 
percentiles of snowfall (Methods). The reciprocal of the temperature de- 
pendence of the snowfall rate is denoted h(T) = pot fT), and the 
asymptotics show that the behaviour of snowfall extremes is dominated 
by the temperature T,, at which h(T) reaches a minimum (roughly —2 °C 
in the simulations and observations). T,,, is the optimal temperature for 
snowfall extremes in the theory, and it arises because of the competition 
between increasing saturation-specific humidity and decreasing snow- 
fall fraction with increasing temperature. The result is that the qth per- 
centile of snowfall s, is given by: 

(te) ee 9 
o 


which is valid asymptotically for large s, values, where I is the gamma 
function, h,, is h evaluated at T,,, and h’” is the second derivative of hat T,,,. 
For a change in mean temperature of 57’ and assuming negligible changes 
in all other parameters, the change in snowfall extremes, 884» is given by: 


or f2. of 
85 = T+ Tm 2 
ohm ( 2 ) @) 
as shown in the Methods. 
According to equation (2), 5s, transitions from positive to negative 
at a mean temperature in the control climate of T,, —5T/2 (roughly 
equal to — 6 °C in the simulations), and it is proportional to 1/(yhm), 
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Figure 3 | Daily snowfall fraction as a function of daily surface air 
temperature. The multimodel-median snowfall fraction is shown for the 
control climate (blue line; shading shows the interquartile range) and the warm 
climate (red line). It is calculated in each model and for each climate as the ratio 
of mean snowfall to mean precipitation in daily temperature bins of width 
0.25 °C. Only land grid boxes in the Northern Hemisphere with surface 
elevation below 1,000 m are included. 


28 AUGUST 2014 | VOL 512 | NATURE | 417 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


— 10-7 
? 
g 
| o- 
s 
o 
Q - 
— _194 grr - Snowfall extremes (control) 
2 = ar Snowfall extremes (warm) 
Pa — — Mean (control) 
20-4 - — — Mean (warm) 
= 
i r T iJ T T T 
-20 -15 -10 -5 0 5 10 


Climatological temperature in control climate (°C) 


Figure 4 | Multimodel-median surface air temperatures at which snowfall 
extremes occur as a function of climatological monthly surface air 
temperature in the control climate. For each control-climate temperature bin, 
surface air temperatures are averaged over grid boxes and days for which the 
daily snowfall is at or above its 99.99th percentile in the control climate 

(blue solid line; shading shows the interquartile range) and warm climate 
(red solid line). Mean temperatures are also shown (dashed lines). The blue 
dashed line deviates from a one-to-one relationship only because of sampling 
variability. Only land grid boxes in the Northern Hemisphere with 

surface elevation below 1,000 m are included. 


which is a characteristic snowfall rate at T= T,,. The change ds, also 
depends inversely on temperature variability as measured by 7, which 
makes sense given that, for example, temperature variability allows daily 
temperatures to reach below freezing even if the mean temperature 
increases to above freezing. Notably, ds, is independent of the percentile 
considered, such that the fractional change 5s,/s, is small for sufficiently 
large s,. This is the main result from the theory—that the temperature 
dependencies of precipitation extremes and the rain-snow transition 
lead to fractional changes in snowfall extremes that are small for suffi- 
ciently large snowfall extremes in the control climate. Snowfall extremes 
respond differently to climate change as compared to precipitation ex- 
tremes or mean snowfall because snowfall extremes tend to occur at 
temperatures in a relatively narrow range near the optimal temperature 
Tm in both the control and warm climates (Fig. 4). As shown schem- 
atically in Extended Data Fig. 7, changes in mean temperature do imply 
changes in the probability of occurrence of temperatures near the opti- 
mal temperature for snowfall extremes, but this only results in changes 
in snowfall extremes that are independent of the percentile considered. 

I applied the theory introduced above to the simulations (Methods; 
Extended Data Fig. 8), and it captures the important features of the re- 
sponse of the snowfall extremes to climate change as a function of cli- 
matological monthly temperature (Fig. 2b). (Application of the theory 
at individual locations is left to future work.) The simulated changes in 
snowfall extremes asymptote towards the simple theoretical form given 
by equation (2) as the percentile is increased, and good agreement with 
the theory is found for the 99.9th and 99.99th percentiles (Extended 
Data Fig. 9). 

Many mountainous regions experience heavy snowfall, but the accu- 
racy of the theory is not as good for regions with surface elevations above 
1,000 m (Extended Data Fig. 10), possibly because of variations in the 
thermodynamic response of orographic precipitation to climate change” 
or the difficulty in simulating orographic snowfall’. Nonetheless, the 
result that fractional decreases in mean snowfall are greater than those 
in snowfall extremes seems to hold regardless of elevation in the simu- 
lations (Fig. 1 and Extended Data Fig. 10). 

Changes in snowfall extremes may still have impacts, and large frac- 
tional decreases do occur in the simulations for more moderate extremes 
and for regions and times of year that are warm enough that there is 
little snowfall in the control climate (Fig. 2). In addition, changes in the 
probability of exceeding a fixed high threshold of snowfall (that is, changes 
in the frequency rather than the intensity of snowfall extremes) may still 
be substantial because of the exponential tail of precipitation distribu- 
tions (Extended Data Fig. 7b). Changes in the frequency of snowfall 
extremes cannot be directly compared with changes in mean snowfall, 
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but they may be important for impacts that involve a fixed threshold of 
snowfall. Previous work suggests that the regional climate-change sig- 
nal of mean snowfall will only emerge after that of temperature, despite 
large reductions in mean snowfall in many regions***. The relatively 
small fractional changes in snowfall extremes found here suggest that 
snowfall extremes may not be an early indicator of climate change in 
many regions; if so, this has corresponding implications for the detec- 
tion and public perception of climate change. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Simulations. The 20 climate models used were BNU-ESM, CanESM2, CMCC- 
CESM, CMCC-CM, CMCC-CMS, CSIRO-Mk3-6-0, GEDL-CM3, GEDL-ESM2G, 
GFDL-ESM2M, HadGEM2-CC, HadGEM2-ES, IPSL-CM5A-LR, IPSL-CM5A- 
MR, IPSL-CM5B-LR, MIROCS5, MIROC-ESM-CHEM, MIROC-ESM, MPI-ESM- 
LR, MPI-ESM-MR and MRI-CGCM3. The time period used for HadGEM2-ES 
for RCP 8.5 was 2081-2099 rather than 2081-2100 because only those years were 
available in the archive. The first ensemble member was used in all cases. 

For Extended Data Fig. 5, a subset of ten models was chosen in which only one 

model is included from each modelling centre: 1. BNU-ESM, 2. CanESM2, 3. CMCC- 
CM, 4. CSIRO-Mk3-6-0, 5. GFDL-CM3, 6. HadGEM2-CC, 7. IPSL-CM5A-MR, 
8. MIROCS, 9. MPI-ESM-MR, and 10. MRI-CGCM3. These models were selected 
as either the most recent or the highest resolution in each case. 
Calculation of daily snowfall extremes. I calculated snowfall extremes in two ways. 
In the first method, 20-year return values were calculated from annual maxima 
using the generalized extreme value (GEV) distribution to allow for relatively-long 
return periods at each grid box. In the second method, daily snowfall rates were 
aggregated in bins according to the climatological monthly surface air temperature 
in the control climate, and high percentiles of snowfall were estimated in each bin; 
this takes into account the sensitive dependence of snowfall on climatological monthly 
temperature and allows for a straightforward comparison with theory. 

In the first method (Fig. 1 and Extended Data Figs 1 and 2), 20-year return values 
of daily snowfall were calculated for each model or observational data set, grid box 
and climate. The 20-year return values were calculated from time series of annual 
maxima by fitting the GEV distribution using probability-weighted moments”. 
Probability-weighted moments were used rather than maximum-likelihood estima- 
tion because of the relatively short samples, and this approach has been previously 
used for precipitation extremes”* and to analyse CMIP5 output'®. The goodness of 
fit was assessed using a Monte Carlo version of the Kolmogorov—Smirnov test”®. 
(A Monte Carlo version of the test was needed because the null hypothesis involves 
parameters estimated from the time series.) Land grid boxes in the Northern Hemi- 
sphere with mean snowfall of greater than 5 cm per year in liquid-water equivalent 
were considered. The fraction of these grid boxes at which the test was passed at the 
10% significance level was found to be close to 10%; the goodness of fit declines if 
grid boxes with mean snowfall lower than 5 cm per year are included in the analysis. 
As an additional check, return values were directly estimated as empirical quantiles 
of the annual maxima time series, and similar results to the GEV estimates were 
found for a range of quantiles. For the results that are presented as maps, the snow- 
fall statistics were interpolated to a common grid before calculation of multimodel 
medians. The conclusions are similar if the snowfall extremes are instead measured 
by the 10-year or 50-year return values (not shown), although the 50-year return 
values must be viewed with caution given that the underlying time series span roughly 
20 years. 

In the second method (see Fig. 2), snowfall statistics were analysed as a function 
of climatological monthly surface air temperature in the control climate. Snowfall 
extremes were calculated as empirical quantiles of the daily snowfall rates in each 
temperature bin (without using the GEV distribution in this case). All days, includ- 
ing days with zero snowfall, were included in the analysis. The sample size of snow- 
fall rates in a given temperature bin is of the order of 10°, and the 99th, 99.9th and 
99.99th percentiles were calculated. 

Comparison of simulations with observations. The mean snowfall and snowfall 
extremes in the simulations are compared with observational estimates in Extended 
Data Figs 1, 2 and 3. Previous global-scale modelling studies have compared simulated 
snowfall rates with snowfall rates from reanalysis’ or monthly snowfall rates derived 
empirically from monthly precipitation rates and monthly surface temperatures’. 
Because observational estimates of daily snowfall are needed and because snowfall 
from reanalysis may be unreliable’, snowfall rates were estimated here on the basis 
of observed daily precipitation rates and surface air temperatures and the observed 
dependence of snowfall fraction on temperature. (Mean snowfall from CloudSat is 
also discussed below). The precipitation rates are over the period 1997-2012 and 
were taken from the one-degree daily merged product V1.2 of the Global Precip- 
itation Climatology Project (GPCP 1DD), which includes inputs from infrared, 
passive microwave, and gauge measurements”’. The precipitation rates were first 
interpolated to a coarser grid with a grid spacing of 2° that is comparable to that of 
the climate models. Conservative interpolation was used to be consistent with the 
treatment of precipitation as a flux”*. The daily surface air temperatures were taken 
from the NCEP-DOE reanalysis 2 (NCEP2)”’. The dependence of snowfall fraction 
on temperature was taken from a study of precipitation at Swedish meteorological 
stations”* (Extended Data Fig. 6) and is given by exp[—0.0000858(T + 7.5)*"?] when 
the surface air temperature T (in degrees Celsius) is between —4 °C and 7 °C. All 
snow was assumed to occur at temperatures below —4 °C and all rain at tempera- 
tures above 7 °C. The snowfall observations are for three-hourly rather than daily 
accumulations, but this is not expected to affect the results presented particularly 
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strongly. For example, the good agreement between models and observations shown 
in Extended Data Fig. 3 is retained ifa simple threshold of 1 °C is used to determine 
precipitation type for the GPCP-based observations (that is, assuming all snow 
below 1 °C and all rain above it). 

In addition, mean snowfall data from CloudSat* were used to provide a second 
and independent comparison with observations (Extended Data Figs 1 and 3). The 
CloudSat product used (2C-SNOW-PROFILE Release 4) includes vertical profiles 
of snowfall rate and surface snowfall rate based on reflectivity profiles from the 
CloudSat Cloud Profiling Radar*’. The data were available for the period mid-July 
2006 to mid-April 2011, which is sufficient to evaluate the mean snowfall rates but 
is too short to allow for estimation of snowfall extremes. 

The overall magnitude and pattern of mean and extreme snowfall are captured 
by the simulations but with some regional discrepancies (Extended Data Figs 1 and 2). 
When interpreting the model and observational maps of snowfall, it is important 
to take into account the area-averaging to a coarse grid and the use of liquid-water 
equivalent rather than snowfall depth. Snowfall biases in the models may partly 
relate to temperature biases’ and inadequate spatial resolution in regions with high 
topography’. There are also regional differences in mean snowfall between the two 
observational estimates (Extended Data Fig. 1), although these differences may relate 
in part to the different time periods used. 

The agreement between the models and the observations is very good when mean 
and extreme snowfall are analysed as a function of climatological temperature in 
the control climate (Extended Data Fig. 3). That agreement is better in this case is 
probably because mean temperature biases are less important when snowfall is 
analysed as a function of climatological temperature and because variability, cir- 
culation biases and random errors are averaged over space and time in each tem- 
perature bin. In addition, there is good agreement between the two observational 
estimates for mean snowfall, except in the lowest temperature bin (Extended Data 
Fig. 3, bottom panel). 

A comparison of the observed snowfall fraction with the snowfall fraction in the 
simulations (including all surface elevations as in the observations) suggests that 
the snowfall fraction in the multimodel median is accurate for temperatures below 
0 °C but declines to zero slightly too quickly for temperatures above 0 °C (Extended 
Data Fig. 6). The discrepancy above 0 °C could also result in part from the inexact 
nature of the comparison between station data and model grid boxes and from the 
difficulty of apportioning mixed snow and rain in observations. This discrepancy 
does not affect the optimal temperature T,,, in the theory of snowfall extremes be- 
cause T,,, <0 °C. Note that the rain-snow transition does not occur precisely at a 
surface temperature of 0 °C because frozen precipitation does not immediately 
melt as it falls past the melting level and because of temperature variability within 
the accumulation period used. 

Derivation of theory for snowfall extremes. The following assumptions are made 
in the derivation, as discussed in the main text. The daily snowfall rate s is related 
to the daily precipitation rate p and daily surface air temperature T according to 
s = f(T)p, where f(T) is the fraction of precipitation that falls as snow at a given 
temperature T. The daily surface air temperature T is assumed to be normally dis- 
tributed with mean T and standard deviation o. The precipitation rate p has a simple 
dependence on T according to p =e?" p. This exponential dependence on temper- 
ature is motivated by the thermodynamic scaling of precipitation extremes under 
climate change”’ and the observed covariability of daily precipitation extremes with 
surface temperature”. The normalized precipitation rate p is assumed to follow a 
gamma distribution on wet days”, such that its probability density function P is 
given by: 

p) = 5(p) 4 77 pk} —1p 
P(p) =(1—w)8(p) + TH? ° 

where 6 is the delta function, I" is the gamma function, w is the fraction of wet days, 
1/y is the scale parameter and k is the shape parameter. (When applying the theory 
to the simulations, wet days are defined as days with precipitation greater than 0.1 mm 
per day rather than precipitation greater than zero, as described here.) The tem- 
perature T and the normalized precipitation rate p are assumed to be independent. 

With these assumptions, the qth percentile of snowfall s, is exceeded if the fol- 
lowing inequality is satisfied: 


pel f(T) >s, 


which requires that p >h(T)s,, where h(T) =e B TT). Assuming s, is non-zero, 
the probability that s, is exceeded may be written as: 


q i" i Wi spay gg Dts 
1-—~= dT dp —~ eP eo 3 
Lg Wii De Vino (3) 


Asymptotic methods are next used to evaluate the double integral in equation (3) 
in the extreme snowfall limit of large s,. The integral in p is first evaluated using 
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a standard asymptotic expression for the incomplete gamma function”: 


oO 
| dt Het zk 1e#[1 4 0(e7})] (4) 
in the limit of large and positive z. Making the identifications 
t=yp 
z=yhs, 
in equation (4) gives: 
k-1 rs k-1 
[.__ acep) (7p) te? = (hs)! es (5) 
YASq 


which is valid asymptotically for large s,. (The tilde symbol to denote ‘is asymp- 
totic to’ is not used here, to avoid confusion with its common use to denote scaling 
behaviour.) Note that y > 0 and h(T) > 0. Substituting equation (5) into equation 
(3) gives: 


q ey AT h(T)E esa (6) 
ee ee ee ee. 4 202 
100 '(k)/2noJ_« 


For large s,, the integral in temperature is dominated by the contribution close 
to T = T,, at which h(T) reaches a minimum, which corresponds physically to snow- 
fall extremes occurring near the optimal temperature T,, (found to be roughly 
—2°C). The integral may be evaluated asymptotically using Laplace’s method”, 
and the general result used here is: 


2m 
xp(c) -1 
—,-~ a(cje 1+O(x ZL 
agen Ere] (7) 
as x —> ©, where the function ¢ reaches a maximum at t = c, and the first and 
second derivatives of # are denoted #' and ¢", respectively. Here 


| : dt g(tee) = 


X=YSq 
t=T 
c=Tn 


$(t) = —h(T) 


and ¢ reaches a maximum when h reaches a minimum. These substitutions are used 
in equation (7) to give: 


i8) PS = 
= _ (0-7? 2m yg ,. —(tm=T? 
| dT h(T)* tea Tau AK> | Valin — gaz 
—o YSq m 


which is valid asymptotically for large s,, and where the subscript ‘m’ refers to a 
quantity evaluated at T = T,,. Substituting this into equation (6) yields equation 


3 
(1) in the main text. Equation (1) can always be solved for s, ifk < e as is generally 


the case in the simulations (Extended Data Fig. 8). 

Derivation of simple expression for changes in snowfall extremes. The change 
in s, may be calculated by evaluating s, from equation (1) in each climate and 
taking the difference. Alternatively, a simple expression is derived here for the 
change in s,, assuming that all parameters other than the mean temperature T 
remain constant. The changes in s, and T between the control and warm climate 
are denoted és, and 57, respectively. Taking the ratio of the left-hand side of 
equation (1) in the warm and control climates and equating it to the same ratio 
for the right-hand side yields: 


= ASEE 
(sq +859)” ellindsy — @—zh[(T +87 Tm)’ —(F—Tn))] 


Taking the logarithm and rearranging terms gives: 


6 8T - oT k-—3 5 
“4 : (t+ T») 2 og( 1+ *) (8) 
Sq oyhmSq 2 hhmSq Sq 


Since the limit of s, + % is being taken, equation (8) implies that 5s,/s, > 0. The al- 
ternative limits 6s;—> —s, or dsq/sq—> © in which the logarithm on the right-hand 


side of equation (8) becomes large in magnitude are inconsistent with equation (8) 
3 
because k < —. Because 554/54 — 0, the second term on the right-hand side of equa- 


tion (8) may then be neglected, and equation (2) in the main text is obtained. 

According to equation (2), the change in snowfall extremes is independent of q, 

w, k and hf. If it is found that 6s, << —s, when applying equation (2), then the 
starting point given by equation (3) is invalid because it assumes s, > 0, and we 
must instead set 5s, = —s,. Note that unlike equation (2), equation (1) has the ac- 
cidental advantage of always implying non-negative snowfall rates even when the 
assumptions made in its derivation are not accurate. 
Application of the theory to the simulations. The snowfall fraction f(T) is needed 
to calculate h(T) and the optimal temperature Ty. It was calculated for each model 
and climate by binning the daily precipitation and snowfall rates in surface air tem- 
perature bins of 0.25 °C over land in the Northern Hemisphere and below or above 
1,000 m elevation as required (Fig. 3). Because the second derivative of h(T) was needed, 
the diagnosed f(T) was smoothed using a Gaussian filter with standard deviation 
0.5 °C before calculation of h(T). The multimodel medians of T,, and f(T) are 
—2.3 °C and 0.89, respectively, in both the control climate and warm climate, for 
the default case of surface elevations below 1,000 m. The functional fit to the snow- 
fall fraction from observations” discussed earlier yields similar values of T,, = —2.3 °C 
and f(Tim) = 0.93. 

The parameter describing the thermodynamic dependence of precipitation ex- 
tremes was set to 8 = 0.06 °C’, following previous work". The other parameters 
in the theory were evaluated for each control-climate temperature bin using the 
temperatures and precipitation rates aggregated within the bin. Wet days were de- 
fined to occur when precipitation is at or above 0.1 mm per day, and the gamma 
distribution was fitted to wet-day values of p using the method of moments to esti- 
mate y and k (Extended Data Fig. 8). 

The theory tends to underestimate the absolute magnitudes of the snowfall ex- 
tremes for the 99.99th percentile (Extended Data Fig. 4), although the fractional 
changes between climates are still accurate (Fig. 2b). The underestimate of the ab- 
solute magnitudes of the 99.99th percentiles results primarily from inaccuracies in 
the fit of the gamma distribution to the distribution of p. The method of moments 
was used to fit the gamma distribution because it was found to give a better fit than 
maximum-likelihood estimation for the moderate and extreme parts of the p distri- 
bution. One potential change to the theory would be to fit alternative distributions*® 
for p, although not all distributions allow for asymptotic evaluation of the integrals 
needed to calculate the snowfall extremes and thus would not lead to a simple result. 
In the section ‘Alternative form of theory using Weibull distribution’ below, I show 
that the theory may still be evaluated asymptotically when the Weibull distribu- 
tion is used instead of the gamma distribution. The conclusions are similar, with 
the primary difference being that greater deviations from an exponential tail are 
possible than with the gamma distribution, and these deviations can lead to a weak 
dependence of the changes in snowfall extremes on the percentile considered. 

The theory also assumes that p (a proxy for upward motion) and temperature 
are independent, but upward motion and precipitation are generally less likely to 
occur on anomalously cold days”, and the accuracy of the theory could be improved 
by accounting for this relationship. This refinement to the theory is not attempted 
here because of the additional complexity and assumptions needed and because the 
current form of the theory adequately captures the main features of the response of 
daily snowfall extremes to climate change. 

Alternative form of theory using Weibull distribution. The theory is also tract- 
able if the normalized precipitation rate p is assumed to follow a Weibull distribu- 
tion on wet days instead ofa gamma distribution. The probability density function 
P for p is then given by: 

P(p) = (1—w)8(p) + wla( ap)! 12 
where 6 is the delta function, w is the fraction of wet days, 1/a is the scale para- 
meter, and / is the shape parameter. In calculating the qth percentile of snowfall, 


the integral in p is exact, and the integral in T is performed using Laplace’s method 
as before. The result is: 


hl eee 
8g hm 2 e(asyhin) — __W a 
(s,hm) a=) Vay 


The simple expression for the change in s,, corresponding to equation (2) when the 
gamma distribution is used, is given by: 


> ua (t+ = T>) (9) 


The parameters in the Weibull distribution were estimated using maximum- 
likelihood estimation, and the results for the changes in snowfall extremes were 
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found to be similar to the results from the theory using the gamma distribution 
(not shown). According to equation (9), the change in snowfall extremes ds, depends 
on (s,)! ~1 and therefore is no longer completely independent of the percentile to the 
extent that / differs from 1. However, this dependence was found to be weak, and 
typical values of /in the simulations are in the range 0.7-1.1. Importantly, it is still the 
case that the fractional change s,/s, is small for sufficiently large s, because / > 0. 
Role of circulation changes and robustness of results. In the theory, y and k are 
the parameters that are most strongly tied to dynamics and updraft strength. These 
parameters do change to some extent as the climate warms (Extended Data Fig. 8), 
but they do not change sufficiently to alter the large contrast between the changes in 
mean and extreme snowfall, and similar results are found whether snowfall extremes 
are estimated from the full theory given by equation (1) or if the simple estimate 
given by equation (2) is used that assumes parameters such as y and k are fixed 
(Fig. 2b). The ratios from equation (2) are calculated as 1 + dsq/sz, where all param- 
eters other than the temperature change are evaluated from the control climate. 

Much of the uncertainty in changes in upward velocities in climate-model simu- 
lations is thought to relate to parameterized moist convection***’,which is more im- 
portant for warm-season or tropical precipitation, even if convection may enhance 
snowfall locally in a given storm. Consistent with this interpretation, extratropical 
precipitation extremes are generally found to respond to climate change in a robust 
manner, unlike tropical precipitation extremes'*”. Inaccuracy in simulating Arctic 
sea-ice loss could affect the warming pattern and circulation, but this would not be 
expected to alter the contrast between the responses of mean and extreme daily 
snowfall substantially, and similar results are found here for the subset of models 
that have previously been identified*® as performing well when simulating Arctic 
sea ice (not shown). 

Extended Data Fig. 5 illustrates the robustness of the greater declines in mean 
snowfall as compared to snowfall extremes. To increase the extent to which the 
models are independent, a subset of ten models with only one model from each 
climate centre was analysed (see the ‘Simulations’ section for the list of models). 
Extended Data Fig. 5a shows that there are widespread regions in which snowfall 
extremes (as measured by the 20-year return period) fractionally decrease by less 
than mean snowfall (or increase) in all ten of the models considered. Extended 
Data Fig. 5b shows that for each of the models separately the fractional decrease 
in snowfall extremes is robustly less than that in mean snowfall for the —2.5 °C 
control-climate temperature bin. 

Heuristic argument for changes in snowfall extremes. The simple estimate given 
by equation (2) may also be obtained using a heuristic argument based on the pro- 
perty that snowfall extremes tend to occur at temperatures close to T;,, in both the 
control and warm climates (Fig. 4). Consider the case, illustrated in Extended Data 
Fig. 7, in which the mean temperature is above T,,, in the control climate. The joint 
probability density function (PDF) of temperature T and normalized precipitation 
p is the product of a Gaussian distribution in temperature and a gamma distri- 
bution in p. An increase in mean temperature reduces the joint PDF in the pre- 
ferred temperature range for extreme snowfall near T,,, (Extended Data Fig. 7a), 
with the result that high percentiles of p and snowfall must also decrease (Extended 
Data Fig. 7b). The integral of the joint PDF over p>s,hm at T= T,, must remain 
approximately the same in each climate because the percentile considered is un- 
changed. At T = T,,,, the joint PDF has an exponential dependence on — (Tm — Ty i 
(207) —yp, and considering only the exponential part for simplicity, we find that 


—6 [(Tm —TY /(20?)} — y8sqhm =0. In the limit of a small change in mean tem- 
perature, we find that 6s, =8T (Tm — T) /(o’yhm), which is consistent with equa- 
tion (2). So the increase in mean temperature reduces the snowfall extremes in this 
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case, but by an amount that is independent of the percentile considered, such that 
the change is a small fraction of the snowfall extreme in the control climate for 
sufficiently high percentiles. 

Snowfall depth versus liquid-water equivalent. Snowfall is expressed in liquid-water 
equivalent in the simulations, but snowfall depth is often measured in observations". 
Snowfall depth depends on snow density in addition to the liquid-water equivalent, 
and snow density depends on temperature as well as other factors. The theory of 
snowfall extremes described above may be easily modified to apply to snowfall depth 
by assuming a functional dependence of snow density on temperature and including 
this dependence in the expression relating snowfall and precipitation rates. The 
snowfall extremes measured in snowfall depth would then be associated with a 
lower optimal temperature T,, than those measured in liquid-water equivalent (for 
example, using equations (1) and (2) of ref. 42 for the density of snow together with 
the observed snowfall fraction curve” yields T,, = —4.3 °C), but the basic features 
of the contrast between the responses of mean snowfall and snowfall extremes 
remain the same. 
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Extended Data Figure 1 | Mean snowfall in simulations and observations. from GPCP/NCEP2 (b) and CloudSat (c). In each case, results are only 
a, The control climate in the multimodel median. b, c, Observational estimates | shown where mean snowfall exceeds 5 cm per year. 
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Extended Data Figure 2 | Daily snowfall extremes in simulations and are the 20-year return values estimated using a fit of the generalized extreme 
observations. a, The control climate in the multimodel median. value distribution to the annual-maximum time series. In each case, results 


b, Observational estimate from GPCP/NCEP2. The snowfall extremes shown _ are only shown where mean snowfall exceeds 5 cm per year. 
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Extended Data Figure 3 | Mean and extreme snowfall as a function of interquartile range) and as estimated from GPCP/NCEP2 (black dashed line). 
climatological monthly surface air temperature in simulations and CloudSat mean snowfall (red dashed-dotted line) is also shown in d. For the 
observations. a—d, The 99.99th (a), 99.9th (b), and 99th (c) percentiles of daily | observational curves, NCEP2 monthly temperatures were used to define the 
snowfall and mean snowfall (d) are shown for the control climate in the climatological monthly surface air temperature bins. Only land grid boxes in 
multimodel median (black solid line with circles; shading shows the the Northern Hemisphere (but all surface elevations) are included. 
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Extended Data Figure 4 | Mean and extreme snowfall in different climates __ shift left with warming (to some extent) because of the important influence of 
as a function of climatological monthly surface air temperature. a-d, The — temperature on snowfall. Also shown are theoretical estimates given by 
multimodel-median 99.99th (a), 99.9th (b), and 99th (c) percentiles of daily equation (1) for high percentiles of snowfall in the control climate (blue dashed 
snowfall and mean snowfall (d) are shown in the control climate (blue line line) and the warm climate (red dashed line). Only land grid boxes in the 
with circles) and warm climate (red line with circles). The snowfall statistics Northern Hemisphere with surface elevation below 1,000 m are included. 
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Extended Data Figure 6 | Snowfall fraction as a function of surface air 
temperature in simulations and observations. The snowfall fraction is shown 
for the control climate in individual models (grey lines) and the multimodel 
median (solid black line). A functional fit to observations is shown for 
comparison (black dashed line). The snowfall fraction for models is calculated 
as the ratio of mean snowfall to mean precipitation in daily temperature bins 
of width 0.25 °C, as in Fig. 3 but with all surface elevations included. The 
functional fit to the observed snowfall fraction is for three-hourly observations 
from Swedish meteorological stations”. 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


a A < Snow Rain > 
w 
a 
oa 
< 
Ke] 
2 
F 0°C T 
bo4 
Control 
Warm 
w 
Qa 
oa 
3 \ 
2 
IN 
) - ‘I 
|| 
0 Sq Sq D/ him 


Extended Data Figure 7 | Schematic illustrating the effect of climate 
warming on the joint PDF of temperature T and normalized precipitation 
rate Ds and the resulting change in a high snowfall percentile sg. a, b, The 
joint PDF as a function of T at a fixed p (a), and as a function of snowfall 
rate p/h, at T= T,, close to which snowfall extremes tend to occur (b). The 
joint PDF is shown for the control (blue) and warm (red) climates. Mean 
snowfall and the probability of snowfall can be inferred to decrease markedly 
with warming from a, while in b the area under the joint PDF to the right of s, is 
the same in each climate, and Sq experiences a relatively small fractional 
decrease with warming. 
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Extended Data Figure 8 | Parameters in the theory as a function of 
climatological monthly surface air in the control climate. a, b, Shown are the 
multimodel-medians of the rate parameter y (a) and shape parameter k (b) in 
the control climate (blue line; shading shows the interquartile range) and 
warm climate (red line). Only land grid boxes in the Northern Hemisphere with 
surface elevation below 1,000 m are included. 
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Extended Data Figure 9 | Multimodel-median changes in snowfall extremes 
between the control and warm climates as a function of climatological 
monthly surface air temperature in the control climate. a-c, 99.99th 

(a), 99.9th (b) and 99th (c) percentiles of daily snowfall for the simulations 
(black line with circles), theory estimate from equation (1) (green dashed line), 


and simple theory estimate from equation (2) (green dashed-dotted line). 
The simple theory estimate is not independent of percentile for high 
climatological temperatures because it is constrained to not imply a negative 
snowfall rate in the warm climate. Only land grid boxes in the Northern 
Hemisphere with surface elevation below 1,000 m are included. 
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Extended Data Figure 10 | Ratios of snowfall for land grid boxes in the 
Northern Hemisphere with elevations at or above 1,000 m. a, b, Ratios are 
shown for the warm climate compared with the control climate as a function of 
climatological monthly surface air temperature in the control climate. 
Multimodel-median ratios of mean snowfall (red line) are shown in both 
panels. a, Multimodel-median ratios of the 99th, 99.9th and 99.99th percentiles 
of daily snowfall in increasing order from light to dark grey. b, Multimodel- 
median ratio of the 99.99th percentile of daily snowfall (grey line; shading 
shows the interquartile range), and the same ratio according to the theory 
estimate from equation (1) (green dashed line) and the simple theory estimate 
from equation (2) (green dashed-dotted line). 
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A primitive fish from the Cambrian of North America 


Simon Conway Morris! & Jean-Bernard Caron? 


Knowledge of the early evolution of fish largely depends on soft- 
bodied material from the Lower (Series 2) Cambrian period of South 
China’. Owing to the rarity of some of these forms and a general 
lack of comparative material from other deposits, interpretations of 
various features remain controversial“, as do their wider relation- 
ships amongst post-Cambrian early un-skeletonized jawless verte- 
brates. Here we redescribe Metaspriggina’ on the basis of new material 
from the Burgess Shale and exceptionally preserved material collected 
near Marble Canyon, British Columbia’, and three other Cambrian 
Burgess Shale-type deposits from Laurentia. This primitive fish dis- 
plays unambiguous vertebrate features: a notochord, a pair of prom- 
inent camera-type eyes, paired nasal sacs, possible cranium and arcualia, 
W-shaped myomeres, and a post-anal tail. A striking feature is the 
branchial area with an array of bipartite bars. Apart from the anterior- 
most bar, which appears to be slightly thicker, each is associated with 
externally located gills, possibly housed in pouches. Phylogenetic 
analysis places Metasprigginaas a basal vertebrate, apparently close 
to the Chengjiang taxa Haikouichthys'* and Myllokunmingia', dem- 
onstrating also that this primitive group of fish was cosmopolitan 
during Lower-Middle Cambrian times (Series 2-3). However, the 
arrangement of the branchial region in Metaspriggina has wider 
implications for reconstructing the morphology of the primitive ver- 
tebrate. Each bipartite bar is identified as being respectively equi- 
valent to an epibranchial and ceratobranchial. This configuration 
suggests that a bipartite arrangement is primitive and reinforces 
the view that the branchial basket of lampreys’ is probably derived. 
Other features of Metaspriggina, including the external position of 
the gills and possible absence of a gill opposite the more robust anterior- 
most bar, are characteristic of gnathostomes® and so may be prim- 
itive within vertebrates. 

Cambrian chordates’~ are effectively restricted to the Burgess Shale*? 
and Chengjiang’~* Lagerstatten, and despite soft-part preservation key 
structures may be difficult to resolve*””°. In addition, differential decay 
of various parts of the body may skew interpretation of character states’ 
and potentially compromise phylogenetic analysis. 

Here we redescribe the poorly known Burgess Shale chordate Meta- 
spriggina walcotti’ (Extended Data Fig. 1), on the basis of approximately 
100 new specimens collected in the Canadian Rockies from several 
Cambrian Burgess Shale-type deposits. Localities include the Burgess 
Shale (Walcott Quarry) in Yoho National Park (Fig. 1g and Extended 
Data Fig. 2), and three sites in Kootenay National Park: Haiduk Peak 
(Extended Data Fig. 3)'* and, most importantly, near Marble Canyon® 
(Fig. la-f, h-m and Extended Data Figs 4, 5). In addition, Emmonsaspis 
(partim) from the Parker Slate Formation’ in Vermont, hitherto ten- 
tatively identified as a frond-like fossil'*, and the Kinzers Formation in 
Pennsylvania (R. Thomas, personal communication), are reinterpreted 
as Metaspriggina spp. (Extended Data Fig. 6). Despite variations in age 
(Extended Data Fig. 7 and Supplementary Table 1) and palaeoenviron- 
mental settings between these occurrences (and by implication tapho- 
nomic histories) the similarities of anatomy allow reliable reconstructions. 

Descriptions are largely based on the more complete ‘Marble Canyon’® 
fossils (see also Fig. 2). Like other Burgess Shale-type material, fossils are 
preserved as carbonaceous compressions and aluminosilicate minerals 


(Extended Data Fig. 4f). Incompleteness precludes a precise estimate of 
size range, but the most complete specimens (Fig. 1a, b) are about 60 mm 
in length and 8-13 mm in height. Laterally the body is fusiform, widest 
near the middle, tapering to a fine point posteriorly (Fig. 1a, b and Ex- 
tended Data Fig. 4a), whereas in dorsal view the anterior termination is 
rounded (Fig. 1d and Extended Data Fig. 4c—e). The animal was com- 
pressed laterally, as is evident from occasional folding of the body as well 
as specimens in dorso-ventral orientation being conspicuously narrower 
(Fig. la and Extended Data Fig. 5a). Along the anterior ventral margin 
there was a keel-like structure (Fig. 1b, g, i, k, 1), but no fins have been 
recognized. In the much more abundant specimens of Haikouichthys'°* 
fins are seldom obvious, suggesting that their absence in Metaspriggina 
might be taphonomic. 

The diagnostic myomeres are invariably present, but are sometimes 
highly disorganized especially towards the anterior (Fig. 1b, d, i and 
Extended Data Fig. 1g), suggesting variable levels of decay". Undisrupted 
specimens show open chevrons with the main apices directly anteriorly 
(Fig. 1h, kand Extended Data Fig. 5c). Dorsally, however, some specimens 
show a smaller additional fold with the blunt apex directed posteriorly 
(Fig. 1k), while in one specimen (Fig. 1h) another posterior inflection 
lies towards the ventral side. Overall, therefore, the myomeres have a 
W-shaped configuration. As the body narrows posteriorly the prin- 
cipal myotomal apices become much more acute (approximately 60°), 
and the subsidiary dorsal inflection is probably lost (Extended Data 
Figs 1g, i, 4a). The myomeres, totalling at least 40, are considerably more 
acute than in Pikaia® and, in contrast to this chordate, Metaspriggina 
was evidently an effective swimmer’. Parker Slate specimens differ in 
being less slender and having myomeres with a more angular closure 
(Extended Data Fig. 6a—d). In dorso-ventral specimens an elongate 
strand (0.25 mm thick) is identified as the notochord (Fig. la-d and 
Extended Data Fig. 1a—d, f). In laterally orientated material the notochord 
lies on the midline, opposite the zone of myotomal closure (Fig. 1h). Occa- 
sional narrower strands in the anterior region probably represent parts 
of the vascular system (Fig. 1d). 

The head is small and slightly bilobed, with smooth margins and 
possibly a central notch (Fig. 1d and Extended Data Fig. 4c-e). It bears 
two prominent eyes (Fig. la-d, f, g, i, j, , m and Extended Data Figs 3, 
4b-e, 5a—d, 6d). These appear to have been originally circular (maximum 
diameter approximately 1.3 mm), but elliptical shapes (Fig. 1d) may re- 
flect slightly oblique burial. Typically the eye is preserved as a reflective 
film, but a well-defined circular area (Fig. 1f) (approximately 0.4 mm) 
appears to be the lens, suggesting a camera-like arrangement. In several 
specimens (Fig. Ic, i, j, 1, m and Extended Data Figs 1c, d, 5b), paired 
circular areas located between the eyes are interpreted as the nasal sacs. 
The proximity of these structures suggests a single median duct might 
have connected them. The paired eyes sometimes lie at a steep angle to 
the body axis (Extended Data Fig. 5b), suggesting that the head formed 
a discrete lobe capable of rotation. A median triangular area may repre- 
sent cranial cartilage (Fig. 1d and Extended Data Fig. 4c—e), while darker 
dorsal elements might indicate possible arcualia (Fig. 1g). 

Posterior to the head and ventrally positioned is a large branchial area. 
This region is identified on the basis of two sets of bars, seven in total, on 
each side of the body (Fig. 1d and Extended Data Fig. 4c). In dorso-ventral 
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Figure 1 | Metaspriggina walcotti (Simonetta and Insom, 1993) specimens 
from Marble Canyon and Walcott’s Quarry collected by the Royal 
Ontario Museum. a-f, h-m, Specimens collected from Marble Canyon. 

g, Specimens collected from Walcott’s Quarry. a-~m, Complete (a-c) and partial 
(d-m) specimens, anterior to the left (except i). a, ROM62935, oblique view. 
b, c, ROM62938, lateral view and close up of the anterior section (framed area 
in c). d, e, ROM62933, oblique view, and close up of gill bars (framed area 

in e). Only the position of some gill bars (Brd1 and Brv1 in e) are highlighted by 
arrows (see also Extended Data Fig. 4c). f, ROM62946, oblique view, pair of 
eyes showing presumed lens (see Extended Data Fig. 5d). g, ROM62964, lateral 
view. h, ROM62924, lateral view, showing W-shaped myomeres and main 
inflections (thin arrows). i, j, ROM62932, lateral view, and close up showing 
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eyes (framed area in j with the image flipped vertically). k, ROM62954, lateral 
view showing internal organs. 1, m, ROM62928, oblique view, and close up 
showing eyes in m. a, b, e, h, j-m, Backscatter scanning electron microscopy 
images (e, j), composite images of both parts and counterparts (f-i, k-m) and 
stitched images at white lines (a, b). An, anus; Ar?, possible arcualia; Bc?, 
possible extra-branchial cartilage; Brv, branchial bars (ventral element); Brd, 
branchial bars (dorsal element); Brp, branchial bar processes; Bv, blood vessel; 
Cc?, possible cranial cartilage; Es, oesophagus; Ey, eyes; Gi, gill filaments; 

Gu, gut; He?, possible heart; Ke, keel; Le, lens; Li, liver; My, myomere; Na, nasal 
sacs; No, notochord; Not, notch. Scale bars: 5 mm (a, b, g-i, k, 1); 2 mm (d); 
1mm (c, e, f, j, m). 
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Figure 2 | Metaspriggina walcotti (Simonetta and Insom, 1993). 
Diagrammatic anatomical drawing, dorsal and lateral views showing main 
morphological features (gills and possible blood vessels; arcualia and extra 
branchial and cranial cartilages have not been reconstructed). Mo?, possible 
position of mouth; Ph, pharyngeal area. For other abbreviations, see Fig. 1. 


view both sets of bars are curved (Fig. 1d), while in lateral aspect the 
upper series is obliquely inclined ventrally and the lower series is more 
steeply angled (Fig. 1b, g). Bars may show a series of short extensions 
(Fig. 1d, e and Extended Data Fig. 4c). In addition, in one specimen 
(Fig. 1d, e) the anterior-most dorsal bar appears to be somewhat more 
robust (0.32 mm in thickness) than the other bars (average = 0.25 mm, 
standard deviation = 0.028). Closely spaced and relatively massive un- 
equivocal gill filaments seem located distal to the associated bars (Fig. 1d, e), 
except perhaps along the anterior-most set of bars. No gill openings have 
been identified, but gills were possibly housed in pouches reminiscent of 
the arrangement in the related Myllokunmingia’ and Haikouichthys’. 
Faint ventro-lateral structures parallel to the body (Fig. 1d) may rep- 
resent extra-branchial cartilage. 

From the roof of the posterior part of the branchial chamber a nar- 
row strand, sometimes with positive relief, is interpreted as the oesoph- 
agus (Fig. 1g, kand Extended Data Fig. 5d, e) and this extends posteriorly 
towards the rest of the gut. An anterior area surrounding the gut trace 
possibly represents the heart (Fig. 1a, b, kand Extended Data Fig. 2e, 6b), 
followed by a darker area that may be a liver (Fig. 1a, b, g, h, k, land Ex- 
tended Data Figs 1a, g, 2b-e, 3b, c, 5a, c, e). Gut content is evident in the 
posterior section and just before the anus (Fig. la and Extended Data 
Fig. le, h). The latter is located on the posterior margin, thus defining 
a post-anal tail. Gut content is indeterminate, but Metaspriggina may 
have been microphagous with the short extensions on the gill bars pos- 
sibly serving an analogous function to gill rakers. 

Phylogenetic analysis (Fig. 3 and Supplementary Information) indi- 
cates Metaspriggina to bea stem vertebrate, possibly close to the slightly 
older Chengjiang taxa Haikouichthys and Myllokunmingia and other- 
wise basal to all other agnathans and gnathostomes. Myllokunmingia is 
known only from one specimen’, but comparisons to Haikouichthys'* 
show similarities (for example, paired eyes, nasal sacs, lobate head), but 
also important differences. Although the myomeral configuration is 
also similar to Haikouichthys'”, in Metaspriggina the additional ventral 
chevron (Fig. 1h) and clear dorsal bend define a W-shaped arrangement 
directly comparable to fish'’. The branchial region in Metaspriggina is 
also more voluminous, potentially consistent with greater respiratory 
demands. The branchial bars have a markedly different orientation 
from Haikouichthys, in which not only are they less well-preserved, but 
in contrast to Metaspriggina the presumed ceratobranchials are par- 
allel to the ventral margin and the upper series of epibranchials lie at a 
steep angle”. 

The configuration of the branchial bars and associated gills in Meta- 
spriggina has three wider implications. The first is that this arrangement 
recalls their frequently invoked ancestral configuration'”-'’, even though 
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this is usually regarded as hypothetical’’. Primitive serial homology is 
often used as a starting point in discussions on the origin of jaws, although 
Metaspriggina is too basal (Fig. 3) to be directly informative in this regard”°. 
The second implication concerns possible links to the branchial config- 
uration in extinct and extant lampreys. An intermediary stage between 
Metaspriggina and the complex branchial basket ofliving lamprey’ may 
occur in Euphanerops”' and the possibly synonymous Endeiolepis”. 
Although their unusual polybranchy may reflect hypoxia”’, in these 
Late Devonian period (Frasnian) agnathans the branchial arches have 
fused into single sinuous units and are now associated with an array of 
small rods and ‘copular elements’. This may prefigure*’ the complex 
basket of extant lampreys’, a configuration that had apparently stabi- 
lized by the latest Devonian period (Famennian)”’. 

Branchial arches of Metaspriggina also show two features reminiscent 
of gnathostomes. Although considered homologous*”*, the position of 
the gills relative to the branchial arches differ, being interior in agnathans 
but exterior in the gnathostomes and apparently also in Metaspriggina. 
While differential preservation cannot be ruled out, the anterior-most 
branchial arch appears more robust and lacks gills. Correspondingly, 
in gnathostomes the mandibular arch is larger and clearly differen- 
tiated and only from the second branchial arch (hyoid) are gills borne’. 
Accordingly, if the anterior-most arch in Metaspriggina was to be equated 
with the mandibular arch then this might suggest that this region was 
always gill-less. In addition the putative extrabranchial cartilages of this 
Cambrian fish find a possible parallel in elasmobranchs*”, although 
convergence is perhaps more likely. Finally, both Metaspriggina and 
Haikouichthys are consistent with the earliest vertebrates possessing 
bipartite arches. Apart from a postulated fusion in the lamprey clade, it 
is suggested that unless homoplasious, this bipartite arrangement may 
have persisted crown-ward. If so, then models of jaw articulation such 
as ‘hinges and caps”*, invaluable in developmental studies, might need 
to be treated with caution”* if employed in the context of atavistic home- 
otic transformations”. 


METHODS SUMMARY 


Most specimens from Marble Canyon were collected in situ (Supplementary Table 1). 
Specimens were prepared mechanically to remove sediment concealing important 
morphological features. Lack of sediment fissility and difficulties of preparation 
often make exposure of the fossil (especially at either end) difficult. Specimens were 
studied using a range of photographic techniques commonly employed for this type 
of material’. Pictures of both parts and counterparts were merged in Adobe Photo- 
shopCS6 using the ‘apply image’ function and ‘darken’ blending mode (Fig. 1a, b, 
f-i, k-m and Extended Data Figs 1-3, 5a, c, d). Scanning electron microscopy images 
were obtained using a JEOL JSM6610-Lv at the University of Toronto (Earth Sciences). 
Methods regarding phylogenetic analysis can be found in Supplementary Information. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 


Received 21 February; accepted 29 April 2014. 
Published online 11 June 2014. 


1. Shu, D.-G. et a/. Lower Cambrian vertebrates from South China. Nature 402, 42-46 
(1999). 

2. Shu, D.-G. et a/. Head and backbone of the Early Cambrian vertebrate 
Haikouichthys. Nature 421, 526-529 (2003). 

3. Hou, X.-G., Aldridge, R. J., Siveter, D. J., Siveter, D. J. & Feng, X.-H. New evidence on 
the anatomy and phylogeny of the earliest vertebrates. Proc. R. Soc. Lond. B 269, 
1865-1869 (2002). 

4. Zhang, X.-G.& Hou, X.-G. Evidence for a single median fin-fold and tail in the Lower 
Cambrian vertebrate, Haikouichthys ercaicunensis. J. Evol. Biol. 17, 1162-1166 
(2004). 

5. Conway Morris, S.A redescription of a rare chordate, Metaspriggina walcotti 
Simonetta and Insom, from the Burgess Shale (Middle Cambrian), British 
Columbia, Canada. J. Paleontol. 82, 424-430 (2008). 


422 | NATURE | VOL 512 | 28 AUGUST 2014 


6. Caron, J.-B. et al. Anew phyllopod bed-like assemblage from the Burgess Shale of 
the Canadian Rockies. Nature Commun. 5, 3210 (2014). 

7. Martin, W.M., Bumm, L. A. & McCauley, D. W. Development of the viscerocranial 
skeleton during embryogenesis of the sea lamprey, Petromyzon marinus. Dev. Dyn. 
238, 3126-3138 (2009). 

8. Janvier, P. Early Vertebrates (Oxford Monographs on Geology and Geophysics) Vol. 33 
(Clarendon, 1996). 

9. Conway Morris, S. & Caron, J.-B. Pikaia gracilens Walcott, a stem-group chordate 
from the Middle Cambrian of British Columbia. Biol. Rev. Camb. Philos. Soc. 87, 
480-512 (2012). 

10. Mallatt, J. & Holland, N. Pikaia gracilens Walcott: stem chordate, or already 
specialized in the Cambrian? J. Exp. Zool. B Mol. Dev. Evol. 320, 247-271 (2013). 

11. Sansom, R. S., Gabbott, S. E. & Purnell, M.A. Atlas of vertebrate decay: a visual and 
taphonomic guide to fossil interpretation. Palaeontology 56, 457-474 (2013). 

12. Johnston, K. J., Johnston, P. A. & Powell, W. G. A new Middle Cambrian, Burgess 
Shale-type biota, Bolaspidella Zone, Chancellor Basin, southeastern British 
Columbia. Palaeogeogr. Palaeoclimatol. Palaeoecol. 277, 106-126 (2009). 

13. Webster, M. Systematic revision of the Cambrian trilobite Bathynotus Hall, 1860, 
with documentation of new occurrences in western Laurentia and implications for 
intercontinental biostratigraphic correlation. Mem. Assoc. Australasian 
Palaeontologists 37, 369-406 (2009). 

14. Conway Morris, S. Ediacaran-like fossils in Cambrian Burgess Shale-type faunas of 
North America. Palaeontology 36, 593-635 (1993). 

15. Lacalli, T. The Middle Cambrian fossil Pikaia and the evolution of chordate 

swimming. EvoDevo 3, 12 (2012). 

16. Van Leeuwen, J. L.A mechanical analysis of myomere shape in fish. J. Exp. Biol. 

202, 3405-3414 (1999). 

17. Gillis, J. A., Modrell, M.S. & Baker, C. V. H. Developmental evidence for serial 
homology of the vertebrate jaw and gill arch skeleton. Nature Commun. 4, 1436 
(2013). 

8. Kuratani, S. Evolution of the vertebrate jaw from developmental perspectives. Evol. 
Dev. 14, 76-92 (2012). 

9. Kuratani, S. Developmental studies of the lamprey and hierarchical evolutionary 
steps towards the acquisition of the jaw. J. Anat. 207, 489-499 (2005). 

20. Gai, Z.-K., Donoghue, P. C. J., Zhu, M., Janvier, P. & Stampanoni, M. Fossil jawless 
fish from China foreshadows early jawed vertebrate anatomy. Nature 476, 
324-327 (2011). 

21. Janvier, P. & Arsenault, M. The anatomy of Euphanerops longaevus Woodward, 
1900, an anaspid-like jawless vertebrate from the Upper Devonian of Miguasha, 
Quebec, Canada. Geodiversitas 29, 143-216 (2007). 

22. Janvier, P., Desbiens, S., Willett, J. A. & Arsenault, M. Lamprey-like gills in a 
gnathostome-related Devonian jawless vertebrate. Nature 440, 1183-1185 
(2006). 

23. Gess, R. W., Coates, M. |. & Rubidge, B.S. A lamprey from the Devonian period of 
South Africa. Nature 443, 981-984 (2006). 

24. Mallatt, J. Early vertebrate evolution: pharyngeal structure and the origin of 
gnathostomes. J. Zool, 204, 169-183 (1984). 

25. Depew, M. J. & Compagnucci, C. Tweaking the hinge and caps: testing a model of 
the organization of jaws. J. Exp. Zool. B Mol. Dev. Evol. 310, 315-335 (2008). 

26. Medeiros, D. M. & Crump, J. G. New perspectives on pharyngeal dorsoventral 
patterning in development and evolution of the vertebrate jaw. Dev. Biol. 371, 
121-135 (2012). 

27. Nichols, J. T., Pan, L-Y., Moens, C. B. & Kimmel, C. B. barx1 represses joints and 
promotes cartilage in the craniofacial skeleton. Development 140, 2765-2775 
(2013). 


Supplementary Information is available in the online version of the paper. 


Acknowledgements We thank T. Lacalli for comments on an earlier draft of the 
manuscript, J. Mallatt for an extensive series of critiques, and J. Hoyal-Cuthill and C. Aria 
for assistance with phylogenetic analyses. We also thank M. Collins for technical 
drawings and reconstructions, and R. Thomas and M. Webster for information on the 
Kinzers and Parker formations, respectively. S.C.M. thanks V. Brown for manuscript 
preparation, and the Department of Earth Sciences and St John’s College, Cambridge 
for support. We thank Parks Canada for granting a collection and research permit to 
J.-B.C. (YNP-2012-12054). Fieldwork support for the 2012 expedition comes from the 
Royal Ontario Museum (DMV Research and Acquisition Fund and DNH Fieldwork 
Fund), M. Streng (Uppsala University and the Swedish Research Council), R. Gaines 
(Pomona College), G. Mangano (University of Saskatchewan) and a Natural Sciences 
and Engineering Research Council Discovery Grant (to J.-B.C., #341944). This is Royal 
Ontario Museum Burgess Shale project number 53. 


Author Contributions J.-B.C. collected fossils, prepared all illustrative material and 
conducted phylogenetic analyses. S.C.M. wrote early drafts of paper, and both authors 
discussed results and developed observations and conclusions. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Readers are welcome to comment on the online version of the paper. Correspondence 
and requests for materials should be addressed to S.C.M. (sc113@cam.ac.uk) or 
J.-B.C. (jcaron@rom.on.ca). 


©2014 Macmillan Publishers Limited. All rights reserved 


Mate Ae dL Teas 


doi:10.1038/nature13665 


Neural constraints on learning 


Patrick T. Sadtler?*, Kristin M. Quick! , Matthew D. Golub?*, Steven M. Chase””, Stephen I. Ryu’, 
Elizabeth C. Tyler-Kabara’*°, Byron M. Yu2*°* & Aaron P. Batistal?3* 


Learning, whether motor, sensory or cognitive, requires networks of 
neurons to generate new activity patterns. As some behaviours are 
easier to learn than others'”, we asked if some neural activity patterns 
are easier to generate than others. Here we investigate whether an 
existing network constrains the patterns that a subset of its neurons 
is capable of exhibiting, and if so, what principles define this con- 
straint. We employed a closed-loop intracortical brain-computer inter- 
face learning paradigm in which Rhesus macaques (Macaca mulatta) 
controlled a computer cursor by modulating neural activity patterns 
in the primary motor cortex. Using the brain-computer interface par- 
adigm, we could specify and alter how neural activity mapped to cursor 
velocity. At the start of each session, we observed the characteristic 
activity patterns of the recorded neural population. The activity of a 
neural population can be represented in a high-dimensional space 
(termed the neural space), wherein each dimension corresponds to 
the activity of one neuron. These characteristic activity patterns com- 
prise a low-dimensional subspace (termed the intrinsic manifold) 
within the neural space. The intrinsic manifold presumably reflects 
constraints imposed by the underlying neural circuitry. Here we show 
that the animals could readily learn to proficiently control the cursor 
using neural activity patterns that were within the intrinsic manifold. 
However, animals were less able to learn to proficiently control the 
cursor using activity patterns that were outside of the intrinsic mani- 
fold. These results suggest that the existing structure of a network can 
shape learning. On a timescale of hours, it seems to be difficult to 
learn to generate neural activity patterns that are not consistent with 
the existing network structure. These findings offer a network-level 
explanation for the observation that we are more readily able to learn 
new skills when they are related to the skills that we already possess**. 

Some behaviours are easier to learn than others’*. We hypothesized 
that the ease or difficulty with which an animal can learn a new behav- 
iour is determined by the current properties of the networks of neurons 
governing the behaviour. We tested this hypothesis in the context of 
brain-computer interface (BCT) learning. In a BCI paradigm, the user 
controls a cursor on a computer screen by generating activity patterns 
across a population of neurons. A BCI offers advantages for studying 
learning because we can observe all of the neurons that directly control 
an action, and we can fully specify the mapping from neural activity to 
action. This allowed us to define which activity patterns would lead to 
task success and to test whether subjects were capable of generating 
them. Previous studies have shown that BCI learning can be remarkably 
extensive*"®, raising the intriguing possibility that most (or all) novel 
BCI mappings are learnable. 

Two male Rhesus macaques (aged 7 and 8 years) were trained to move 
a cursor from the centre of screen to one of eight radially arranged tar- 
gets by modulating the activity of 85-91 neural units (that is, threshold 
crossings on each electrode) recorded in the primary motor cortex (Fig. la). 
To represent the activity of the neural population, we defined a high- 
dimensional space (called the neural space) where each axis corresponds 


to the activity of one neural unit. The activity of all neural units during 
a short time period is represented as a point in this space (Fig. 1b). At 
each time step, the neural activity (a green point in Fig. 1b) is mapped 
onto a control space (black line in Fig. 1b; two-dimensional plane in the 
actual experiments, corresponding to horizontal and vertical cursor veloc- 
ity) to specify cursor velocity. The control space is the geometrical rep- 
resentation of a BCI mapping. At the start of each day, we calibrated an 
‘intuitive mapping’ by specifying a control space that the monkey used 
to move the cursor proficiently (Extended Data Fig. 1). 

At the beginning of each day we also characterized how each neural 
unit changed its activity relative to the other neural units (that is, how 
the neural units co-modulated). In the simplified network represented 
in Fig. 1b, neurons 1 and 3 positively co-modulate due to common input, 
whereas neurons 1 and 2 negatively co-modulate due to an indirect inhib- 
itory connection. Such co-modulations among neurons mean that neural 
activity does not uniformly populate the neural space’*"’*. We iden- 
tified the low-dimensional space that captured the natural patterns of 
co-modulation among the recorded neurons. We refer to this space as 
the intrinsic manifold (yellow plane in Fig. 1b, c). By construction, the 
intuitive mapping lies within the intrinsic manifold. Our key experimental 
manipulation was to change the BCI mapping so that the control space 
was either within or outside of the intrinsic manifold. A within-manifold 
perturbation was created by re-orienting the intuitive control space 
but keeping it within the intrinsic manifold (depicted as the red line in 
Fig. 1c). This preserved the relationship between neural units and co- 
modulation patterns, but it altered the way in which co-modulation 
patterns affected cursor kinematics (red arrows, Fig. la). An outside- 
manifold perturbation was created by re-orienting the intuitive control 
space and allowing it to depart from the intrinsic manifold (depicted as 
the blue line in Fig. 1c). This altered the way in which neural units con- 
tributed to co-modulation patterns, but it preserved the way in which co- 
modulation patterns affected cursor kinematics (blue arrows, Fig. 1a). 
In both cases, performance was impaired once the new mapping was 
introduced, and we observed whether the monkeys could learn to re- 
gain proficient control of the cursor. 

To regain proficient control of the cursor under a within-manifold per- 
turbation, the animals had to learn new associations between the natural 
co-modulation patterns and the cursor kinematics (Fig. 1d). To restore 
proficient control of the cursor under an outside-manifold perturbation, 
the animals had to learn to generate new co-modulation patterns among 
the recorded neurons. Our hypothesis predicted that within-manifold 
perturbations would be more readily learnable than outside-manifold 
perturbations. 

Just after the perturbed mappings were introduced, BCI performance 
was impaired (Fig. 2a, b, first grey vertical band). Performance improved 
for the within-manifold perturbation (Fig. 2a), showing that the ani- 
mal learned to control the cursor under that mapping. In contrast, per- 
formance remained impaired for the outside-manifold perturbation 
(Fig. 2b), showing that learning did not occur. We quantified the amount 
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Figure 1 | Using a brain-computer interface to study learning. a, Monkeys 
moved the BCI cursor (blue circle) to acquire targets (green circle) by 
modulating their neural activity. The BCI mapping consisted of first mapping 
the population neural activity to the intrinsic manifold using factor analysis, 
then from the intrinsic manifold to cursor kinematics using a Kalman filter. 
This two-step procedure allowed us to perform outside-manifold perturbations 
(blue arrows) and within-manifold perturbations (red arrows). D, dimensions. 
b, A simplified, conceptual illustration using three electrodes. The firing rate 
(FR) observed on each electrode in a brief epoch define a point (green dots) 
in the neural space. The intrinsic manifold (yellow plane) characterizes the 
prominent patterns of co-modulation. Neural activity maps onto the control 
space (black line) to specify cursor velocity. c, Control spaces for an intuitive 
mapping (black arrow), within-manifold perturbation (red arrow) and outside- 
manifold perturbation (blue arrow). d, Neural activity (green dot) elicits 
different cursor velocities (open circles and inset) under different mappings. 
Arrow colours as in ¢. 


of learning as the extent to which BCI performance recovered from its 
initial impairment to the level attained while using the intuitive map- 
ping (Fig. 2c). For within-manifold perturbations, the animals regained 
proficient control of the cursor (red histograms in Fig. 2d and Extended 
Data Fig. 2), indicating that they could learn new associations between 
natural co-modulation patterns and cursor kinematics. For outside- 
manifold perturbations, BCI performance remained impaired (blue his- 
tograms in Fig. 2d and Extended Data Fig. 2), indicating that it was 
difficult to learn to generate new co-modulation patterns, even when 
those patterns would have led to improved performance in the task. 
These results support our hypothesis that the structure of a network 
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Figure 2 | Better learning for within-manifold perturbations than 
outside-manifold perturbations. a, b, Task performance during one 
representative within-manifold perturbation session (a) and one representative 
outside-manifold perturbation session (b). Black trace, success rate; green 
trace, target acquisition time. Dashed vertical lines indicate when the BCI 
mapping changed. Grey vertical bands represent 50-trial bins used to determine 
initial (red and blue dots) and best (red and blue asterisks) performance with 
the perturbed mapping. c, Quantifying the amount of learning. Black dot, 
performance with the intuitive mappings; red and blue dots, performance 
(success rate and acquisition time are relative to performance with intuitive 
mapping) just after the perturbation was introduced for sessions in Fig. 2a and 
Fig. 2b; red and blue asterisks, best performance during those perturbation 
sessions; dashed line, maximum learning vector for the session in Fig. 2a. The 
amount of learning for each session is the length of the raw learning vector 
projected onto the maximum learning vector, normalized by the length of the 
maximum learning vector. This is the ratio of the length of the thin red line 
to the length of the dashed line. d, Amount of learning for all sessions. A value of 
1 indicates complete learning of the relationship between neural activity and 
kinematics, and 0 indicates no learning. Learning is significantly better for 
within-manifold perturbations (red, n = 28 (monkey J), 14 (monkey L)) than 
for outside-manifold perturbations (blue, n = 39 (monkey J), 15 (monkey L)). 
Arrows indicate the sessions shown in Fig. 2a (red) and Fig. 2b (blue). Dashed 
lines, means of distributions; solid lines, mean + standard error of the mean 
(s.e.m.). Pvalues were obtained from two-tailed Student’s t-tests. 


determines which patterns of neural activity (and corresponding behav- 
iours) a subject can readily learn to generate. 

Two additional lines of evidence show that BCI control was more learn- 
able when using within-manifold perturbations than outside-manifold 
perturbations. First, perturbation types differed in their after-effects. After 
a lengthy exposure to the perturbed mapping, we again presented the 
intuitive mapping (the second dashed vertical line in Fig. 2a, b). Follow- 
ing within-manifold perturbations, performance was impaired briefly 
(Extended Data Fig. 3, red histogram), indicating that learning had 
occurred". Following outside-manifold perturbations, performance was 
not impaired, which is consistent with little, ifany, learning having occurred 
(Extended Data Fig. 3, blue histogram). Second, the difference in learn- 
ability between the two types of perturbation was present from the earliest 
sessions, and over the course of the study the monkeys did not improve 
at learning (Extended Data Fig. 4). 

These results show that the intrinsic manifold was a reliable predictor 
of the learnability ofa BCI mapping: new BCI mappings that were within 
the intrinsic manifold were more learnable than those outside of it. We 
considered five alternative explanations for the difference in learnabil- 
ity. First, we considered the possibility that mappings which were more 
difficult to use initially might be more difficult to learn. We ensured that 
the initial performance impairments were equivalent for the two pertur- 
bation types (Fig. 3a). 
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Figure 3 | Alternative explanations do not explain the difference in 
learnability between the two types of perturbation. a, Performance 
impairment immediately following within-manifold and outside-manifold 
perturbations. b, Mean principal angles between intuitive and perturbed 
mappings. c, Mean required change in preferred direction (PD) for individual 
neural units. For all panels: red, within-manifold perturbations; blue, outside- 
manifold perturbations; dashed lines, means of distributions; solid lines, 
mean + s.e.m.; P values are for two-tailed Student’s t-tests; same number of 
sessions as in Fig. 2d. 


Second, we posited that the animals must search through neural space 
for the new control space following the perturbation. Ifthe control spaces 
for one type of perturbation tended to be farther from the intuitive con- 
trol space, then they might be harder to find, and thus, learning would 
be reduced. We ensured that the angles between the intuitive and per- 
turbed control spaces did not differ between the two perturbation types 
(Fig. 3b). Incidentally, Fig. 3b also shows that the perturbations were not 
pure workspace rotations. If that were the case, the angles between con- 
trol spaces would have been zero, not in the range of 40-80° as shown. 

Third, we considered how much of an impact the perturbations exerted 
on the activity of each neural unit. Learning is manifested (at least in part) 
as changes in the preferred direction (that is, the direction of movement 
for which a neuron is most active) of individual neurons”"*. If learning 
one type of perturbation required larger changes in preferred directions 
of neural units, then those perturbations might be harder to learn. We 
predicted the changes in preferred directions that would be required to 
learn each perturbation while minimizing changes in activity. We ensured 
that learning the two perturbation types required comparable preferred- 
direction changes (Fig. 3c). 

Fourth, for one monkey (L), we ensured that the sizes of the search 
spaces for finding a strategy to proficiently control the cursor were the 
same for both perturbation types (see Methods). 

Fifth, hand movements were comparable and nearly non-existent 
for both perturbation types and should therefore have had no impact 
on learnability (Extended Data Fig. 5). 

We conclude from these analyses that the parsimonious explanation 
for BCI learning is whether or not the new control space is within the 
intrinsic manifold. These alternative explanations did reveal interesting 
secondary aspects of the data; they partially explained within-category 
differences in learnability, albeit in an idiosyncratic manner between 
the two monkeys (Extended Data Fig. 6). 

A key step in these experiments was the identification of an intrinsic 
manifold using dimensionality reduction". Although our estimate of 
the intrinsic manifold can depend on several methodological factors 
(Extended Data Fig. 7 caption), the critical property of such a manifold 
is that it captures the prominent patterns of co-modulation among the 
recorded neurons, which presumably reflect underlying network con- 
straints. For consistency, we estimated a linear, ten-dimensional intrinsic 
manifold each day. In retrospect, we considered whether our choice of 
ten dimensions had been appropriate (Fig. 4). We estimated the intrinsic 
dimensionality of the neural activity for each day (Fig. 4a); the average 
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Figure 4 | Properties of the intrinsic manifold. a, Cross-validated log- 
likelihoods (LL; arbitrary units) of the population activity for different days. 
The peaks (open circles) indicate the estimated intrinsic dimensionality (EID). 
Vertical bars indicate the standard error of log-likelihoods, computed across 
four cross-validation folds. A ten-dimensional intrinsic manifold was used for 
all experiments (solid circles). b, EID across all days and both monkeys 
(mean = s.e.m., 9.81 + 0.31). c, Difference between the LL for the ten- 
dimensional (10D) model and the EID model. Units are the number of 
standard errors of LL for the EID model. For 89% (78 of 88) of the days, the LL 
for the ten-dimensional model was within one standard error of the EID model. 
All sessions were less than two standard errors away. d, Cumulative shared 
variance explained by the ten-dimensional intrinsic manifold used during the 
experiment. Coloured curves correspond to the experimental days shown in 
Fig. 4a. The black curve shows the mean = s.e.m. across all days (n = 88; 
monkey J, 58; monkey L, 30). 


dimensionality was about ten (Fig. 4b). Even though the estimated dimen- 
sionalities ranged from 4-16, the selection of ten dimensions still provided 
a model that was nearly as good as the best model (Fig. 4c). Because the 
top few dimensions captured the majority of the co-modulation among 
the neural units (Fig. 4d), we probably could have selected a different 
dimensionality within the range of near-optimal dimensionalities and 
still attained similar results (Extended Data Fig. 7 caption). We note that 
we cannot make claims about the ‘true’ dimensionality of the primary 
motor cortex, in part because it probably depends on considerations such 
as the behaviours the animal is performing and, perhaps, its level of skill. 

Sensorimotor learning probably encompasses a variety of neural mech- 
anisms, operating at diverse timescales and levels of organization. We 
posit that learning a within-manifold perturbation harnesses the fast- 
timescale learning mechanisms that underlie adaptation”, whereas learn- 
ing an outside-manifold perturbation engages the neural mechanisms 
required for skill learning’. This suggests that learning outside-manifold 
perturbations could benefit from multi-day use*”’. Such learning might 
require the intrinsic manifold to expand or change orientation. 

Other studies have employed dimensionality-reduction techniques to 
interpret how networks of neurons encode information’*'° and change 
their activity during learning”***. Our findings strengthen those discov- 
eries by showing that low-dimensional projections of neural data are not 
only visualization tools—they can reveal causal constraints on the activ- 
ity attainable by networks of neurons. Our study also indicates that the 
low-dimensional patterns present among a population of neurons may 
better reflect the elemental units of volitional control than do individ- 
ual neurons. 

In summary, a BCI paradigm enabled us to reveal neural constraints 
on learning. The principles we observed may govern other forms of learn- 
ing*’*8 and perhaps even cognitive processes. For example, combin- 
atorial creativity’, which involves re-combining cognitive elements in 


28 AUGUST 2014 | VOL 512 | NATURE | 425 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


new ways, might involve the generation of new neural activity patterns 
that are within the intrinsic manifold of relevant brain areas. Trans- 
formational creativity, which involves creating new cognitive elements, 
may result from generating neural activity patterns outside of the rel- 
evant intrinsic manifold. More broadly, our results help to provide a 
neural explanation for the balance we possess between adaptability and 
persistence in our actions and thoughts”. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Electrophysiology and behavioural monitoring. We recorded from the proximal 
arm region of the primary motor cortex in two male Rhesus macaques (Macaca 
mulatta, aged 7 and 8 years) using 96-channel microelectrode arrays (Blackrock 
Microsystems) as the monkeys sat head-fixed in a primate chair. All animal handling 
procedures were approved by the University of Pittsburgh Institutional Animal Care 
and Use Committee. At the beginning of each session, we estimated the root-mean- 
square voltage of the signal on each electrode while the monkeys sat calmly in a 
darkened room. We then set the spike threshold at 3.0 times the root-mean-square 
value for each channel. Spike counts used for BCI control were determined from 
the times at which the voltage crossed this threshold. We refer to the threshold cross- 
ings recorded on one electrode as one neural unit. We used 85-91 neural units each 
day. We did not use an electrode if the threshold crossing waveforms did not resem- 
ble action potentials or if the electrode was electrically shorted to another electrode. 
The data were recorded approximately 19-24 months after array implantation for 
monkey J and approximately 8-9 months after array implantation for monkey L. 

We monitored hand movements using an LED marker (PhaseSpace Inc.) on the 
hand contralateral to the recording array. The monkeys’ arms were loosely restrained. 
The monkeys could have moved their forearms by approximately 5 cm from their 
arm rests, and there were no restrictions on wrist movement. The hand movements 
during the BCI trials were minimal, and we observed that the monkeys’ movements 
did not approach the limits of the restraints. Extended Data Fig. 5a shows the aver- 
age hand speed during the BCI trials. For comparison, Extended Data Fig. 5b shows 
the average hand speed during a standard point-to-point reaching task. We also 
recorded the monkeys’ gaze direction (SR Research Ltd). Those data are not ana- 
lysed here. 

Task flow. Each day began with a calibration block during which we determined 
the parameters of the intuitive mapping. The monkeys then used the intuitive map- 
ping for 400 trials (monkey J) or 250 trials (monkey L) during the baseline block. 
We then switched to the perturbed mapping for 600 trials (monkey J) or 400 trials 
(monkey L) for the perturbation block. This was followed by a 200-trial washout 
block with the intuitive mapping. Together, the perturbation and washout blocks 
comprised a perturbation session. The transitions between blocks were made seam- 
lessly, without an additional delay between trials. We gave the monkey no indica- 
tion which type of perturbation would be presented. On most days, we completed 
one perturbation session (monkey J, 50 of 58 days; monkey L, 29 of 30 days). On 
nine days, we completed multiple perturbation sessions. 

Experimental sessions. We conducted 78 (30 within-manifold perturbations; 48 
outside-manifold perturbations) sessions with monkey J. We conducted 31 sessions 
(16 within-manifold perturbations; 15 outside-manifold perturbations) with mon- 
key L. For both monkeys, we did not analyse a session if the monkey attempted 
fewer than 100 trials with the perturbed mapping. For monkey J, we did not analyse 
11 sessions (2 within-manifold perturbations; 9 outside-manifold perturbations). 
For monkey L, we did not analyse 3 sessions (2 within-manifold perturbations; 1 
outside-manifold perturbation). 

BCI calibration procedures. Each day began with a calibration block of trials. The 
data that we recorded during these blocks were used to estimate the intrinsic man- 
ifold and to calibrate the parameters of the intuitive mappings. For monkey J, we 
used two calibration methods (only one ona given day), and for monkey L, we used 
one method for all days. 

The following describes the BCI calibration procedures for monkey J. The first 
method for this monkey relied on the neural signals being fairly stable across days. 
At the beginning of each day, the monkey was typically able to control the cursor 
proficiently using the previous day’s intuitive mapping. We collected data for cali- 
bration by having the monkey use the previous day’s intuitive mapping for 80 trials 
(10 per target). 

We designed the second method because we were concerned about the potential 
for carry-over effects across days. This method relied on passive observation of cursor 
movement”. The monkey observed the cursor automatically complete the centre-out 
task for 80 trials (10 per target). At the beginning of each trial, the cursor appeared 
in the centre of the monkey’s workspace for 300 ms. Then, the cursor moved at a 
constant velocity (0.15ms_') to the pseudo-randomly chosen target for each trial. 
When the cursor reached the target, the monkey received a juice reward. After each 
trial, there was a blank screen for 200 ms before the next trial. 

For both methods for monkey J, we used the neural activity recorded 300 ms 
after the start of each trial until the cursor reached the peripheral target for BCI 
calibration. 

The following describes the BCI calibration procedure for monkey L. We observed 
that neural activity for this monkey was not as stable from day to day as it was for 
monkey J. As a result, we could not use the calibration procedure relying on the 
previous day’s intuitive mapping. Additionally, the observation-based calibration 
procedure was not as effective at generating an intuitive decoder for monkey Las it 
had been for monkey J. Therefore, we used a a closed-loop calibration procedure 
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(similar to reference 32) to generate the intuitive decoder. The procedure began 
with 16 trials (2 to each target) of the observation task. We calibrated a decoder from 
these 16 trials in the same manner as the first method for monkey J. We then switched 
to the BCI centre-out task, and the monkey controlled the velocity of the cursor 
using the decoder calibrated on the 16 observation trials. We restricted movement 
of the cursor so that it moved ina straight line towards the target (that is, any cursor 
movement perpendicular to the straight path to the target was scaled by a factor of 0). 
After 8 trials (1 to each target), we calibrated another decoder from those 8 trials. 
The monkey then controlled the cursor for 8 more trials with this newly calibrated 
decoder with perpendicular movements scaled by a factor of 0.125. We then cali- 
brated a new decoder using all 16 closed-loop trials. We repeated this procedure 
over a total of 80 trials until the monkey was in full control of the cursor (perpen- 
dicular velocity scale factor = 1). We calibrated the intuitive mapping using the 80 
trials during which the monkey had full or partial control of the cursor. For each of 
those trials, we used the neural activity recorded 300 ms after the start of the trial 
until the cursor reached the peripheral target. 

BCI centre-out task. The same closed-loop BCI control task was used during the 
baseline, perturbation and washout blocks. At the beginning of each trial, the cursor 
(circle, radius = 18 mm) appeared in the centre of the workspace. One of eight possible 
peripheral targets (chosen pseudo-randomly) was presented (circle, radius = 20 mm; 
150 mm (monkey J) or 125 mm (monkey L) from centre of workspace, separated by 
45°). A 300 ms freeze period ensued, during which the cursor did not move. After 
the freeze period, the velocity of the cursor was controlled by the monkey through 
the BCI mapping. The monkey had 7,500 ms to move the cursor into the peripheral 
target. If the cursor acquired the peripheral target within the time limit, the mon- 
key received a juice reward. After 200 ms, the next trial began. With the intuitive 
mappings, the monkeys’ movement times were near 1,000 ms (Extended Data Fig. 1), 
but the monkeys sometimes exceeded the 7,500 ms acquisition time limit with the 
perturbed mappings. If the cursor did not acquire the target within the time limit, 
there was a 1,500 ms time-out before the start of the next trial. 

Estimation of the intrinsic manifold. We identified the intrinsic manifold from 
the population activity recorded during the calibration session using the dimen- 
sionality reduction technique factor analysis****. The central idea is to describe the 
high-dimensional population activity u in terms of a low-dimensional set of fac- 
tors z. Each factor is distributed according to the standard normal distribution N. 
This can be written in vector form as: 


z~N(0,1) (1) 
where J is the identity matrix. The neural activity is related to those factors by: 
ulz~N(Az+ ph) (2) 


where uelR?*? is a vector of z-scored spike counts (z-scoring was performed sep- 
arately for each neural unit) taken in non-overlapping 45 ms bins across the q neural 
units, and zeR!°*! contains the ten factors. That is, the neural activity u given a set 
of factors z is distributed according to a normal distribution with mean Az + p and 
diagonal covariance . The intrinsic manifold is defined as the column space of A. 
Each factor, or latent dimension, is represented by a column of A. We estimated A, 
wand w using the expectation-maximization algorithm”*’. The data collected dur- 
ing the calibration sessions had 1,470 + 325 (monkey J, mean + standard deviation) 
and 1,379 + 157 (monkey L) samples. 

Intuitive mappings. The intuitive mapping was a modified version of the standard 
Kalman filter*®. A key component of the experimental design was to use the Kalman 
filter to relate factors (z) to cursor kinematics rather than to relate neural activity 
directly to the cursor kinematics. This modification allowed us to perform the two 
different types of perturbation. We observed that performance with our modified 
Kalman filter is qualitatively similar to performance with a standard Kalman filter 
(data not shown). 

The first step in the construction of the intuitive mapping was to estimate the 
factors using factor analysis (equations (1) and (2)). For each z-scored spike count 
vector u;, we computed the posterior mean of the factors a= E[z;|u]. We then 
z-scored each factor (that is, each element of 2) separately. 

The second step was to estimate the horizontal and vertical velocity of the cursor 
from the z-scored factors using a Kalman filter: 


X1|xr-1 ~ N(Axr_1 +b, Q) (3) 


2, |x-~N(Cx; + d,R) (4) 


where x,¢R?*! is a vector of horizontal and vertical cursor velocity at time step t. 
We fitted the parameters A, b, Q, C, d and R using maximum likelihood by relating 
the factors to an estimate of the monkeys’ intended velocity during the calibration 
sessions. At each time point, this intended velocity vector either pointed straight 
from the current cursor position to the target with a speed equal to the current cursor 
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speed” (monkey J, first calibration task) or pointed straight from the centre of the 
workspace to the target with a constant speed (0.15ms_', monkey Land monkey J, 
second calibration task). 

Because spike counts were z-scored before factor analysis, 4. = 0. Because factors 
were z-scored before decoding into cursor velocity, d = 0. Because calibration kine- 
matics were centred about the centre of the workspace, b=0. 

The decoded velocity that was used to move the cursor at time step t was R= 
E|x,| 2),....2,|.Wecan express R, in terms of the decoded velocity at the previous 
time step Re and the current z-scored spike count vector u;: 


R,=M Ry 1+Mou, (5) 
M,=A—KCA (6) 
M,=K2.f (7) 
B=A™(AAT +0) (8) 


As part of the procedure for z-scoring factors, &, is a diagonal matrix where the 
(p, p) element is the inverse of the standard deviation of the pth factor. K is the steady- 
state Kalman gain matrix. We z-scored the spike counts and the factors in the intu- 
itive mappings so that the perturbed mappings (which were based on the intuitive 
mappings) would not require a neural unit to fire outside of its observed spike count 
range. 

Perturbed mappings. The perturbed mappings were modified versions of the intu- 
itive mapping. Within-manifold perturbations altered the relationship between fac- 
tors and cursor kinematics. The elements of the vector 2; were permuted before being 
passed into the Kalman filter (red arrows, Fig. 1b). This preserves the relationship 
between neural units and the intrinsic manifold, but changes the relationship between 
dimensions of the intrinsic manifold and cursor velocity. Geometrically, this cor- 
responds to re-orienting the control space within the intrinsic manifold. 

The following equations describe within-manifold perturbations: 


A A 
X= M\X;-1 + Mow (9) 


Mow =Kiwy=zP (10) 


where Mwy, is a 10 x 10 permutation matrix defining the within-manifold per- 
turbation (that is, the within-manifold perturbation matrix). Each element ofa per- 
mutation matrix is either 0 or 1. In each column and in each row of a permutation 
matrix, one element is 1, and the other elements are 0. In other words, nwyy2z Puy 
is a permuted version of Xz fu;. 

Outside-manifold perturbations altered the relationship between neural units 
and factors. The elements of 4, were permuted before being passed into the factor- 
analysis model (blue arrows, Fig. 1b). This preserves the relationship between fac- 
tors and cursor velocity, but changes the relationship between neural units and factors. 
Geometrically, this corresponds to re-orienting the control space within the neural 
space and outside of the intrinsic manifold. 

The following equations describe outside-manifold perturbations: 


A A 
Xx; = My x;-1 +M2.0mm 


(11) 
(12) 


where Noy is a q X q permutation matrix defining the outside-manifold perturba- 
tion (that is, the outside-manifold perturbation matrix). In other words, noyu isa 
permuted version of u;. 

Choosing a perturbed mapping. We used data from the first 200 trials (monkey J) 
or 150 trials (monkey L) of closed-loop control during the baseline blocks to deter- 
mine the perturbation matrix that we would use for the session. The procedure we 
used had three steps (detailed below). First, we defined a set of candidate perturba- 
tions. Second, we predicted the open-loop cursor velocities for each candidate per- 
turbation. Third, we selected one candidate perturbation. We aimed to choose a 
perturbation such that the perturbed mapping would not be too difficult for the 
monkeys to use nor so easy that no learning was needed to achieve proficient 
performance. 

For monkey J, we often alternated perturbation types across consecutive days. 
For monkey L, we determined which type of perturbation we would use each day 
before the first experiment. That order was set randomly by a computer. We did this 
in order to avoid a detectable pattern of perturbation types. 

The following describes the first step in choosing a perturbed mapping: defining 
the candidate perturbations. For within-manifold perturbations, nw, is a 10 x 10 
permutation matrix. The total number possible yw, is 10 factorial (3,628,800). We 
considered all of these candidate within-manifold perturbations. 


M2,om=K2izhnom 


For outside-manifold perturbations, yoy, is a q x q permutation matrix, where q 
is the number of neural units. For a population of 90 neural units, there are 90 fac- 
torial (>10'”°) possible values of yo;. Due to computational constraints, we were 
unable to consider every possible 79,4 as. a candidate perturbation. We used slightly 
different procedures to determine the candidate outside-manifold perturbations 
for the two monkeys. 

The procedure we used for monkey J is as follows. We permuted the neural units 
independently. We chose to permute only the neural units with the largest modu- 
lation depths (mean number of units permuted, 39 + 18). Permuting the units with 
larger modulation depths impacted the monkey’s ability to proficiently control the 
cursor more than would permuting units with smaller modulation depths. For each 
session, we randomly chose 6 million 75), that permuted only the specified units. 
This formed the set of candidate outside-manifold perturbations. 

The procedure we used for monkey L is as follows. To motivate it, note that the 
two perturbation types altered the intuitive mapping control space within a different 
number of dimensions of the neural space for monkey J. Within-manifold pertur- 
bations were confined to ten dimensions of the neural space, but outside-manifold 
perturbations were confined to N dimensions of the neural space (where N is the 
number of permuted units, 39 on average). Thus, the dimensionality of the space 
through which the monkey would have to search to find the perturbed control space 
was different for the two types of perturbed mappings; it was larger for the outside- 
manifold perturbations than it was for the within-manifold perturbations. We 
recognized that this difference may have affected the monkey’s ability to learn outside- 
manifold perturbations. For monkey L, we reduced the size of the search space for 
the outside-manifold perturbations, thereby equalizing the size of the search space 
for the two perturbation types. We did this by constraining 1.9, so that the number 
of possible 79,, was equal to the number of candidate within-manifold perturba- 
tions. We then considered all yo, to be candidate outside-manifold perturbations. 
To construct outside-manifold perturbations, we assigned each neural unit to one 
of eleven groups. The first ten groups had an equal number of neural units. The 
eleventh group had the remaining neural units. We specifically put the neural units 
with the lowest modulation depths in the eleventh group. The 10m (where m is the 
number of neural units per group) neural units with the highest modulation depths 
were randomily assigned to the first ten groups. We created outside-manifold per- 
turbations by permuting the first ten groups, keeping all the neural units within a 
group together. Thus, the number of possible 179,, is 10 factorial, all of which were 
considered as candidate outside-manifold perturbations. 

We attempted to keep these groupings as constant as possible across days. On 
some days, one electrode would become unusable (relative to the previous day) as 
evident from the threshold crossing waveforms. When this occurred, we kept all of 
the groupings fixed that did not involve that electrode. If an electrode in one of the 
first ten groups became unusable, we would substitute it with a neural unit from the 
eleventh group. 

The following describes the second step in choosing a perturbed mapping: esti- 
mating the open-loop velocities of each candidate perturbation. The open-loop velocity 
can be thought of as a coarse approximation to how the cursor would move if the 
monkey did not learn. The open-loop velocity measurement captures how the neural 
activity updates the velocity of the cursor from the previous time step, whereas the 
closed-loop decoder (equation (5)) also includes contributions from the decoded 
velocity at the previous time step (MX;_ ) as well as from the neural activity at the 
current time step (M2 u;). To compute the open-loop velocity, we first computed 
the average z-scored spike counts of every neural unit in the first 200 (monkey J) or 
150 (monkey L) trials of the baseline block. We binned the spike counts from 300 ms 
to 1,300 ms (monkey J) or 1,100 ms (monkey L) after the beginning of each trial, 
and then averaged the spike counts for all trials to the same target. Together, these 
comprised 8 spike count vectors (one per target). For each of the spike count vectors, 
we computed the open-loop velocity for the candidate perturbations: 


— i 
Xo. =M2p ug 


(13) 
where ui, is the mean z-scored spike count vector for the ih target. Mo p is Mo wm for 
within-manifold perturbations and M2 om for outside-manifold perturbations. 
The following describes the third step in choosing a perturbation: selecting a 
candidate perturbation. For each candidate perturbation, we compared the open- 
loop velocities under the perturbed mapping to the open-loop velocities under the 
intuitive mapping on a per-target basis. We needed the velocities to be dissimilar 
(to induce learning) but not so different that the animal could not control the cursor. 
For each target, we measured the angles between the 2D open-loop velocity vectors. 
Wealso measured the magnitude of the open-loop velocity for the perturbed map- 
ping. For each session, we defined a range of angles (average minimum of range across 
sessions: mean + s.e.m, 19.7° + 7.0°; average maximum of range across sessions: 
44.4° + 8.9°) anda range of velocity magnitudes (average minimum of range across 
sessions, 0.7 mm s }+0.4mm st; average maximum of range across sessions, 
5.5mms_ '+4,0mms_'). Note that when the monkey controlled the cursor in 
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closed-loop (equation (5)), the cursor speeds were much greater than these ranges 
of open-loop velocities. This is because M, was nearly an identity matrix for our 
experiments. Thus, the term M Re 1 is expected to be larger than the term M2 u;. 
We found all candidate perturbations for which the angles and magnitudes for all 
targets were within the designated ranges. From the candidate perturbations that 
remained after applying these criteria, we arbitrarily chose one to use as the per- 
turbation for that session. 

Amount of learning. This section corresponds to Fig. 2c. For each session, we com- 
puted the amount of learning during perturbation blocks as a single, scalar value 
that incorporated both changes in success rate (percent of trials for which the periph- 
eral target was acquired successfully) and target acquisition time. We sought to use 
a metric that captured how much the monkeys’ performance improved through- 
out the perturbation block relative to how much it was impaired at the beginning of 
the perturbation block. Having a single value for each session allowed us to more 
easily compare learning across sessions and to relate the amount of learning to a 
variety of properties of each perturbation (Extended Data Fig. 6). We also analysed 
each performance criterion individually for each monkey without any normaliza- 
tion (Extended Data Fig. 2). We saw consistent differences in learnability. Thus, our 
results do not rely on the precise form of our learning metric, but the form we used 
provides a scalar value as a convenient summary metric. 

As success rate and target acquisition time are expressed in different units, we 
first normalized each metric. We found the mean and standard deviation of the suc- 
cess rates and target acquisition times across all non-overlapping 50-trial bins in 
the baseline, perturbation and washout blocks for each monkey. We then z-scored 
the success rates and target acquisition times separately for each monkey. Figure 2c 
shows normalized performance projected onto veridical units. 

For each session, we computed the average z-scored success rate and the average 
z-scored target acquisition time across all bins in the baseline block. 


s 
n-(2 
4B 


where Px is the performance, sp is the average normalized success rate and ag is the 
average normalized acquisition time during the baseline block (monkey J, 386.9 + 82.5 
trials; monkey L, 292.1 + 43.5 trials). 

Wealso computed the normalized success rates and acquisition times for all bins 
in the perturbation blocks. 


(14) 
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where Pp(j) is the performance, sp(j) is the normalized success rate, and ap(j) is 
the average normalized acquisition time during the jth 50-trial bin of the perturba- 
tion block. 

Empirically, we observed that the monkeys’ performance during the perturba- 
tion blocks did not exceed the performance during the baseline blocks. Therefore, 
we define a maximum learning vector (Lmax) as a vector that extends from the per- 
formance in the first bin with the perturbed mapping to the point corresponding to 
baseline performance (Fig. 2c). 


Fac Py et) (16) 


The length of this vector is the initial performance impairment because it describes 
the drop in performance that resulted when we switched from the baseline block to 
the perturbation block (shown in Fig. 3a and Extended Data Fig. 6a). For each bin 
(j) within the perturbation blocks, we defined a raw learning vector (Lyaw(j)). This 
vector extended from the point corresponding to initial performance during the 
perturbation block to the point corresponding to performance during each bin. 


Lraw(j) = Pe(j) — Pp(1) (17) 


We projected the raw learning vectors onto the maximum learning vector. These 
were termed the projected learning vectors (Lpro(j)). 


Lsex Linas 
L roj j)= Liaw j)* 
eed) ( 0) Ty) (i) 


The lengths of the projected learning vectors relative to the lengths of the max- 
imum learning vectors define the amount of learning in each 50-trial bin (Lpin(j)). 


Loin(J) = [|Lproj(3) I 


19 
|| Lax | ae 
An amount of learning of 0 indicates that the monkey did not improve perform- 
ance, and a value of 1 indicates that the monkey fully improved (up to the level 
during the baseline block). For each session, we computed the amount of learning 
for all bins, and we selected the largest one as the amount of learning for that session. 


(18) 
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Lesion = max; (Loin (j)) (20) 
Figure 2c shows the raw learning vectors for one bin in each of two sessions (thick 
blue and red lines), along with the projected learning vector (thin red line) and the 
maximum learning vector (dashed grey line) for one of those sessions. 

Principal angles between intuitive and perturbed control spaces. This section 
corresponds to Fig. 3b and Extended Data Fig. 6b. The control spaces for the intu- 
itive and perturbed BCI mappings in our experiments were spanned by the rows of 
M> for the intuitive mapping, M> wm for within-manifold perturbations and M2,.om 
for outside-manifold perturbations. Because we z-scored spike counts in advance, 
the control spaces for each day intersected at the origin of the neural space. The two 
principal angles** between the intuitive and perturbed control spaces defined the 
maximum and minimum angles of separation between the control spaces (Fig. 3b). 
Required preferred direction changes. This section corresponds to Fig. 3c and 
Extended Data Fig. 6c. One way in which learning is manifested is by changes in 
how individual neurons are tuned to the parameters of the movement, in particu- 
lar the preferred direction”"*. For each session, we sought to compute the required 
changes in preferred direction for each neural unit that would lead to proficient 
control of the cursor under the perturbed mapping. One possibility would be to exam- 
ine the columns of M; and M; p. Each column can be thought of as representing the 
pushing direction and pushing magnitude of one unit (that is, the contribution of 
each neural unit to the velocity of the cursor). We could simply estimate the required 
change in preferred direction by measuring the change in pushing directions for 
each unit between the intuitive and perturbed mappings. However, this method is 
not suitable for the following reason. For outside-manifold perturbations for mon- 
key J, we permuted only a subset of the neural units. Asa result, the columns of M> om 
corresponding to the non-permuted units were the same as in M). By estimating the 
required changed in preferred direction as the difference in directional components 
of M2 and M2 om, we would be implicitly assuming that the monkey is capable of 
identifying which units we perturbed and changing only their preferred directions, 
which appears to be difficult to achieve in the timeframe of a few hours’. Therefore, 
we sought a more biologically plausible method of computing the required preferred 
direction changes. 

Using a minimal set of assumptions, we computed the firing rates that each unit 
should show under one particular learning strategy. Then, we computed the preferred 
direction of each unit using those firing rates and compared them to the preferred 
directions during the baseline block. The following were the assumptions used to 
compute the firing rates: 

1. We assumed the monkeys would intend to move the cursor to each target at 
the same velocity it exhibited under the intuitive mapping. Fitts’ Law predicts that 
movement speed depends on movement amplitude and target size*’, and these were 
always the same in our experiments. 

2. The firing rates for the perturbed mapping should be as close as possible to the 
firing rates we recorded when the monkeys used the intuitive mapping. This keeps 
the predicted firing rates within a physiological range and implies a plausible explo- 
ration strategy in neural space. 

We used the following procedure to compute the required preferred direction 
changes. First, we found the average normalized spike count vector ui, across time 
points (300-1,000 ms after the start of the trial) and all trials to each target (i) dur- 
ing the baseline blocks. We minimized the Euclidian distance between ui, and ui, 
the normalized spike count vector for the perturbed mapping (assumption 2), sub- 
ject to Mui, = Mp pu’, (assumption 1). Mz ui, (the open-loop velocity for the intuitive 
mapping) is known from the baseline block. For a given perturbed mapping (with 
M),»), we sought to find u’, that would lead to the same open-loop velocity, which 
has a closed-form solution: 


(21) 


For each neural unit (k), we computed its preferred direction 03 (k) with the intu- 
itive mapping by fitting a standard cosine tuning model. 


ui, (k) =m: cos(0; — Og(k)) + bx 


. F -1 7 
ui, =ui, +ME, (MMI ») (Mz — Mp») u‘, 


(22) 


where ui,(k) is the kth element of u‘,, m, is the depth of modulation, b; is the model 
offset of unit k, and 0; is the direction of the ith target. We also computed the pre- 
ferred direction of each unit for the perturbed mapping (0p(k)) in the same way. 
Figure 3c shows histograms of 


|Op(k) — On(k)| 


averaged across all units for each session. 

Estimation of intrinsic dimensionality. This section accompanies Fig. 4a-c. 
During all experiments, we identified a ten-dimensional intrinsic manifold (that is, 
ten factors). Offline, we confirmed this was a reasonable choice by estimating the 
intrinsic dimensionality of the data recorded in each calibration block. For each day, 


(23) 
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we performed a standard model-selection procedure to compare factor-analysis 
models with dimensionalities ranging from 2 to 30. For each candidate dimension- 
ality, we used fourfold cross-validation. For each fold, we estimated the factor- 
analysis model parameters using 75% of the calibration data. We then computed 
the likelihood of the remaining 25% of the calibration data with the factor-analysis 
model. For each dimensionality, we averaged the likelihoods across all folds. Each 
day’s ‘intrinsic dimensionality’ was defined as the dimensionality corresponding to 
the largest cross-validated data likelihood of the calibration data for that day. 
Measuring the cumulative shared variance explained. This section corresponds 
to Fig. 4d. Factor analysis partitions the sample covariance of the population activity 
(cov(u)) into a shared component (AA‘) and an independent component (vj). In 
offline analyses, we sought to characterize the amount of shared variance along 
orthogonal directions within the intrinsic manifold (akin to measuring the lengths 
of the major and minor axes of an ellipse). These shared variance values are given 
by the eigenvalues of AA”, which can be ordered from largest to smallest. Each eigen- 
value corresponds to an ‘orthonormalized latent dimension’, which refers to iden- 
tifying orthonormal axes that span the intrinsic manifold. Each orthonormalized 
dimension is a linear combination of the original ten dimensions. The cumulative 
shared variance curve is thus informative of how ‘oblong’ the shared variance is within 
the manifold, and it can be compared across days. By definition, the cumulative 
shared variance explained reaches 100% using all ten dimensions, and none of the 
independent variance (w) is explained by those latent dimensions. 

Blinding. Investigator blinding was ensured because all sessions were analysed in 
the same way, by the same computer program. This parallel and automatic treatment 
of the two perturbation types eliminated investigator biases. The animals were blinded 
to the test condition delivered each day. If the animals knew which of the two con- 
ditions they were presented with, that might have biased our findings. Blinding was 
achieved before-the-fact with a random and/or unpredictable ordering of experi- 
ments, and after-the-fact with control analyses to ensure that conditions were matched 
as closely as we could detect. 


Statistics. For the histograms in Figs 2d and 3, Extended Data Figs 1b, 2 and 7a, the 
significances of the differences in distributions between within-manifold perturba- 
tion samples and outside-manifold perturbation samples were determined with two- 
tailed Student’s t-tests assuming unequal variances of the two samples. We ensured 
that each histogram followed a normal distribution (Kolmogorov-Smirnoyv test). In 
Extended Data Figs 1a and 3, the histograms did not follow a normal distribution 
(Kolmogorov-Smirnov test). For those figures, we used the Wilcoxon rank-sum test 
to determine the significance of the difference in the distributions. For the linear 
regressions in Fig. 4 and Extended Data Figs 4 and 6, we determined the significance 
level of the slopes being different from 0 using F-tests for linear regression. We 
determined whether the difference between two slopes was significant using two- 
tailed Student’s t-tests. For all tests, we used P = 0.05 as the significance threshold. 
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Extended Data Figure 1 | Performance during baseline blocks. 

a, Histograms of success rate during the baseline blocks on days when the 
perturbation would later be within-manifold (red) and outside-manifold 
(blue) for monkey J (top) and monkey L (bottom). For days with multiple 
perturbation sessions, the data are coloured according to the first perturbation 
type. Dashed lines, means of distributions; solid lines, mean + s.e.m. 

b, Histograms of target acquisition time during baseline blocks. Number of 
days for panels a and b: within-manifold perturbations, n = 27 (monkey J), 
14 (monkey L); outside-manifold perturbations, n = 31 (monkey J), 


0.8 1.0 


Acquisition time (s) 


14 (monkey L). c, Sample cursor trajectories to all eight targets. At the 
beginning of each day, the monkeys used the intuitive mapping for 250-400 
trials. The monkeys were able to use these mappings to control the cursor 
proficiently from the outset (as measured by success rate and acquisition time). 
On all sessions, the success rates were near 100%, and the acquisition times were 
between 800 and 1,000 ms. No performance metrics during the baseline blocks 
were significantly different between within-manifold perturbation sessions 
and outside-manifold perturbation sessions (P > 0.05; success rate, Wilcoxon 
rank-sum test; acquisition time, two-tailed Student’s t-test). 
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Extended Data Figure 2 | Changes in success rate and acquisition time 
during perturbation blocks. In Fig. 2d, we quantified the amount of learning 
in each session using a single metric that combined improvements in success 
rate and acquisition time. Here, we consider each metric separately. In each 
comparison, better performance is to the right. a, Change in success rate 
from the first 50-trial bin in the perturbation block to the bin with the best 
performance. The change in success rate was significantly greater for within- 
manifold perturbations than for outside-manifold perturbations for monkey J 
(top, P< 103, t-test). For monkey L (bottom), the change in success rate was 
greater for within-manifold perturbations than for outside-manifold 
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perturbations, and the difference approached significance (P = 0.088, t-test). 
b, Change in acquisition time from the first 50-trial bin in the perturbation 
block to the bin with the best performance. For both monkeys, the change in 
acquisition time for within-manifold perturbations was significantly greater 
than for outside-manifold perturbations (monkey J (top), P< 10“, t-test; 
monkey L (bottom), P = 0.0014, t-test). Note that a negative acquisition time 
change indicates performance improvement (that is, targets were acquired 
faster). Number of within-manifold perturbations, n = 28 (monkey J), 

14 (monkey L); outside-manifold perturbations, n = 39 (monkey J), 

15 (monkey L). 
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Extended Data Figure 3 | After-effects during washout blocks. After 600 
(monkey J) or 400 (monkey L) trials using the perturbed mapping, we 
re-introduced the intuitive mapping to observe any after-effects of learning. 
We measured the after-effect as the size of the performance impairment at 
the beginning of the washout block in the same way that we measured the 
performance impairment at the beginning of the perturbation block. 

A larger after-effect indicates more learning had occurred in response to the 
perturbation. For monkey J (left), the after-effect was significantly larger for 


| | 
4 


1 2 3 
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within-manifold perturbations (red) than for outside-manifold perturbations 
(blue) (Wilcoxon rank-sum test, P< 107°). For monkey L (right), the trend is 
in the same direction as monkey J, but the effect did not achieve significance 
(Wilcoxon rank-sum test, P > 0.05). These data are consistent with the 
hypothesis that relatively little learning occurred during the outside-manifold 
perturbations in comparison to the within-manifold perturbations. Number of 
within-manifold perturbations, n = 27 (monkey J), 14 (monkey L); outside- 
manifold perturbations, n = 33 (monkey J), 15 (monkey L). 
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Extended Data Figure 4 | Learning did not improve over sessions. It might 
have been that, over the course of weeks and months, the animals improved 
at learning to use perturbed mappings, either one type or both types together. 
This did not occur. Within-manifold perturbations showed more learning than 
outside-manifold perturbations across the duration of experiments. Animals 


did not get better at learning to use either type of perturbation separately 
(red and blue regression lines, F-test, P > 0.05 for all relationships) nor when 
considering all sessions together (black regression line, F-test for linear 
regression, P > 0.05). Same number of sessions as in Extended Data Fig. 2. 
Each point corresponds to one session. 
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Extended Data Figure 5 | Hand speeds during BCI control and hand 

control. We loosely restrained the monkeys’ arms to the chair’s armrests 
during experiments. The monkeys minimally moved their hands, but the 
movements did not approach the limits of the restraints. a, Average hand 
speeds across all trials in all sessions for the baseline blocks (left column), 
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within-manifold perturbation blocks (middle column), and outside-manifold 
perturbation blocks (right column) for monkey J (top row) and monkey L 
(bottom row). b, Average hand speed during a typical point-to-point reaching 
task (monkey L). Thus, the hand movements for the BCI tasks are substantially 
smaller than for the reaching task. 
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Extended Data Figure 6 | Accounting for within-class differences in 
learning. a, Relation between amount of learning and initial impairment in 
performance for monkey J (top) and monkey L (bottom). Each point 
corresponds to one session. Lines are linear regressions for the within-manifold 
perturbations and outside-manifold perturbations. *Slope significantly 
different than 0 (F-test for linear regression, P < 0.05). b, Relation between 
amount of learning and mean principal angles between control spaces for 
perturbed and intuitive mappings. c, Relation between amount of learning and 
mean required preferred direction (PD) change. Same number of sessions as in 
Extended Data Fig. 2. Figure 3 showed that the properties of the perturbed 
mappings (other than whether their control spaces were within or outside the 
intrinsic manifold) could not account for differences in learning between the 
two types of perturbation. However, as is evident in Fig. 2d, within each type of 
perturbation, there was a range in the amount of learning, including some 
outside-manifold perturbations that were learnable*’. In this figure, we 
examined whether learning within each perturbation type could be accounted 
for by considering other properties of the perturbed mapping. We regressed the 


amount of learning within each perturbation type against the various properties 
we considered in Fig. 3. Panel a shows the initial performance impairment 
could explain a portion of the variability of learning within both classes of 
perturbation for monkey J. That monkey showed more learning on sessions 
when the initial performance impairment was larger. For monkey L, the initial 
performance impairment could account for a portion of the within-class 
variation in learning only for outside-manifold perturbations; this monkey 
showed less learning when the initial performance impairment was larger. We 
speculate that monkey J was motivated by more difficult perturbations while 
monkey L could be frustrated by more difficult perturbations. Panel b shows 
that the mean principal angles between control planes were related to learning 
within each class of perturbation for monkey L only. Larger mean principal 
angles between the control planes led to less learning. Panel c shows that the 
required PD changes were not related to learning for either type of perturbation 
for both monkeys. This makes the important point that we were unable to 
account for the amount of learning by studying each neural unit individually. 
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Extended Data Figure 7 | Offline analyses of intrinsic manifold properties. 
a, The intrinsic dimensionalities for all sessions for monkey J (left) and 
monkey L (right). For both monkeys, the intrinsic dimensionalities were not 
significantly different between days when we performed within-manifold 
perturbations and days when we performed outside-manifold perturbations 
(t-test, P > 0.05). Dashed lines, means of distributions; solid lines, 

mean ~ s.e.m. Same number of days as in Extended Data Fig. 1. b, Relation 
between intrinsic dimensionality and the number of data points used to 
compute intrinsic dimensionality. For each of 5 days (one curve per day), we 
computed the intrinsic dimensionality using 25%, 50%, 75% and 100% of the 
total number of data points recorded during the calibration block. As the 
number of data points increased, our estimate of the intrinsic dimensionality 
increased in a saturating manner. c, Tuning of the raw factors. These plots 
exhibit the factors that were shuffled during within-manifold perturbations. 
We show for one typical day the average factors (2) corresponding to the ten 
dimensions of the intrinsic manifold over a time interval of 700 ms beginning 
300 ms after the start of every trial. Within each row, the coloured bars indicate 
the mean + standard deviation of the factors for each target. The line in 

each circular inset indicates the axis of ‘preferred’ and ‘null’ directions of the 
factor. The length of the axis indicates the relative depth of modulation. The 
tuning is along an axis (rather than in a single direction) because the sign of 
a given factor is arbitrary. d, Tuning of the orthonormalized factors. Same 
session and plotting format as c. The orthonormalized dimensions are ordered 
by the amount of shared variance explained, which can be seen by the variance 
of the factors across all targets. Note that the axes of greatest variation are 
separated by approximately 90° for orthonormalized dimensions 1 and 2. 
This property was typical across days. The retrospective estimate of intrinsic 
dimensionality (Fig. 4 and Extended Data Fig. 7a) may depend on the richness 
of the behavioural task, the size of the training set (Extended Data Fig. 7b), 


Factor value (2) 


the number of neurons, the dimensionality reduction method and the criterion 
for assessing dimensionality. Thus, the estimated intrinsic dimensionality 
should only be interpreted in the context of these choices, rather than in 
absolute terms. The key to the success of this experiment was capturing the 
prominent patterns by which the neural units co-modulate. As shown in 

Fig. 4d, the top several dimensions capture the majority of the shared variance. 
Thus, we believe that our main results are robust to the precise number of 
dimensions used during the experiment. Namely, the effects would have 
been similar as long as we had identified at least a small handful of dimensions. 
Given the relative simplicity of the BCI and observation tasks, our estimated 
intrinsic dimensionality is probably an underestimate (that is, a richer task 
may have revealed a larger set of co-modulation patterns that the circuit is 
capable of expressing). Even so, our results suggest that the intrinsic manifold 
estimated in the present study already captures some of the key constraints 
imposed by the underlying neural circuitry. The probable underestimate of 
the ‘true’ intrinsic dimensionality may explain why a few nominal outside- 
manifold perturbations were readily learnable (Fig. 2d). It is worth noting that 
improperly estimating the intrinsic dimensionality would only have weakened 
the main result. If we had overestimated the dimensionality, then some of 
the ostensible within-manifold perturbations would actually have been outside- 
manifold perturbations. In this case, the amount of learning would tend to 
be erroneously low for nominal within-manifold perturbations. If we had 
underestimated the dimensionality, then some of the ostensible outside- 
manifold perturbations would actually have been within-manifold 
perturbations. In this case, the amount of learning would tend to be erroneously 
high for outside-manifold perturbations. Both types of estimation error would 
have decreased the measured difference in the amount of learning between 
within-manifold perturbation and outside-manifold perturbations. 
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Processing properties of ON and OFF pathways for 
Drosophila motion detection 


Rudy Behnia’, Damon A. Clark??, Adam G. Carter*, Thomas R. Clandinin’ & Claude Desplan’? 


The algorithms and neural circuits that process spatio-temporal changes 
in luminance to extract visual motion cues have been the focus of 
intense research. An influential model, the Hassenstein—Reichardt 
correlator’, relies on differential temporal filtering of two spatially 
separated input channels, delaying one input signal with respect to 
the other. Motion in a particular direction causes these delayed and 
non-delayed luminance signals to arrive simultaneously at a subse- 
quent processing step in the brain; these signals are then nonlinearly 
amplified to produce a direction-selective response. Recent work in 
Drosophilahas identified two parallel pathways that selectively respond 
to either moving light or dark edges”’. Each of these pathways requires 
two critical processing steps to be applied to incoming signals: differ- 
ential delay between the spatial input channels, and distinct processing 
of brightness increment and decrement signals. Here we demonstrate, 
using in vivo patch-clamp recordings, that four medulla neurons imple- 
ment these two processing steps. The neurons Mil and Tm3 respond 
selectively to brightness increments, with the response of Mil delayed 
relative to Tm3. Conversely, Tm1 and Tm2 respond selectively to 
brightness decrements, with the response of Tm1 delayed compared 
with Tm2. Remarkably, constraining Hassenstein-Reichardt corre- 
lator models using these measurements produces outputs consistent 
with previously measured properties of motion detectors, including 
temporal frequency tuning and specificity for light versus dark edges. 
We propose that Mil and Tm3 perform critical processing of the 
delayed and non-delayed input channels of the correlator responsible 
for the detection of light edges, while Tm1 and Tm2 play analogous 
roles in the detection of moving dark edges. Our data show that spe- 
cific medulla neurons possess response properties that allow them 
to implement the algorithmic steps that precede the correlative oper- 
ation in the Hassenstein-Reichardt correlator, revealing elements of 
the long-sought neural substrates of motion detection in the fly. 
Behavioural and electrophysiological studies in flies have demonstrated 
that visual motion responses display the fundamental signatures pre- 
dicted by the Hassenstein—Reichardt correlator (HRC)'*”. In Drosophila, 
photoreceptors R1-R6 are required for motion detection, and synapse 
onto three lamina monopolar cells L1, L2 and L3 (ref. 6), which provide 
inputs to distinct motion pathways”*’"". L1 feeds into pathways involved 
in detecting moving light edges** while L2, with contributions from L1 
and L3, is involved in detecting moving dark edges**"° (Fig. 1b). Deeper 
in the optic lobe, two direction-selective neuronal types, T4 and T5, are 
also differentially tuned: T4 cells respond to moving light edges while 
T5 cells respond to moving dark edges’*. Both T4 and T5 are required 
for downstream, direction-selective responses of the visual system out- 
put cells called lobula plate tangential cells'*"*. According to the HRC 
model, these light and dark edge direction-selective pathways each require 
two processing steps: differential temporal delay and nonlinear amplifica- 
tion (Fig. 1a). Moreover, these two pathways must process changes in 
contrast differently to respond differentially to light and dark edges. One 
such asymmetric mechanism is ‘half-wave rectification’, where inputs of one 
polarity are amplified and inputs of the opposite polarity are suppressed. 


Since L1 and L2 relay information about both contrast increments 
and decrements’ (they hyperpolarize in response to light increments 
and depolarize in response to decrements) and act as inputs to motion 
pathways, we focused on medulla neurons that link L1 and L2 to T4 and 
T5 to identify potential sites of half-wave rectification and delay. Elec- 
tron microscopic reconstruction of the medulla has identified columnar 
neurons types Tm3 and Mil as receiving the large majority of synapses 
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Figure 1 | Motion detection and the fly optic lobe. a, A half HRC sensitive to 
rightward motion. An object moving from left to right first activates input 1 
and then input 2. The signal from input 1 is delayed (t) and arrives at the 
correlation stage (M for multiplication) close in time to the signal from unit 2, 
nonlinearly enhancing the signal. For leftward motion, the signals are separated 
in time by the delay and no motion signal is generated. In the full correlator 
model, two mirror symmetric correlators are subtracted, producing responses 
that have opposite signs for opposite directions (see Fig. 4a). b, Light edge (L1) 
and dark edge (L2) motion-sensitive pathways in the Drosophila optic lobe. 
L1 and L2 lamina monopolar cells in the lamina provide inputs to two distinct 
motion-sensitive pathways that selectively respond to moving light edges and 
dark edges, respectively. L1 and L3 also contribute to the pathway detecting 
moving dark edge (not shown). T4 and T5 in the lobula complex are the main 
inputs to lobula plate tangential cells (LPTCs), and are themselves direction 
selective. T4 neurons respond selectively to moving light edges and T5 neurons 
respond to moving dark edges. Mil and Tm3 are the main postsynaptic targets 
of L1 while Tm1 and Tm2 are the main postsynaptic targets of L2. The 
axons of Mil and Tm3 contact T4 in the most proximal medulla layer, whereas 
Tm1 and Tm2 contact T5 dendrites in superficial lobula layers (modified from 
ref. 30). c, In vivo electrophysiology set up: a window is cut in a dorsal 

region of the head cuticle of an immobilized live fly to expose the cell bodies of 
medulla neurons to a glass pipette used to perform the recordings. Grey-scale 
images are displayed on a screen positioned in front of the fly, using a digital 
light projector coupled to a coherent fibre optic bundle. 
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from LI (refs 15, 16). Similarly, Tm1 and Tm2 are the main synaptic tar- 
gets of L2 (refs 15, 16). Furthermore, Mil and Tm3 together constitute 
over 80% of the presynaptic inputs to T4 (ref. 16) and both Tm1 and 
Tm2 provide input to T5 (ref. 17). Based on their innervation patterns, 
Mil and Tm3 have been proposed to be core components of the motion 
detector for light edges involving T4 (ref. 16). Similarly, Tm1 and Tm2 are 
likely candidates for analogous roles in a motion detector for dark edges. 

Since changes in luminance are central to motion detection, we first 
examined the responses of Mil, Tm3, Tm1 and Tm2 to step changes in 
light intensity by performing whole-cell current-clamp recordings on 
awake immobilized fruit flies (Fig. 1c). Both Mil and Tm3 responded 
with a strong, transient depolarization at the onset ofa 1 s light step, and 
then transiently hyperpolarized to below pre-stimulus levels at light 
offset (Fig. 2a right and Extended Data Fig. 1a). The responses to onset 
and offset of light differed in magnitude: in Mil, the offset hyperpolar- 
ization amplitude was only 11% (s.e.m. = 3.5%) of the onset depolari- 
zation, while in Tm3 this fraction was 36.6% (s.e.m. = 7.1%) (Extended 
Data Fig. 1c). A brief flash of light (200 ms) elicited a sharper depolar- 
ization in both cells, with the offset hyperpolarization terminating the 
depolarization phase of the response (Fig. 2a left). The responses observed 
in Tm1 and Tm2 were similar to each other, yet were strikingly different 
from those in Mil and Tm3 (Fig. 2b and Extended Data Fig. 1b). Tm1 
and Tm2 hyperpolarized at light onset, and depolarized strongly at light 
offset. The hyperpolarization of Tm1 evoked by stimulus onset was 26.1% 
(s.e.m. = 3.8%) as large as the depolarization evoked at offset; for Tm2, 
this number was 17.7% (s.e.m. = 2.3%) (Extended Data Fig. 1c). Finally, 
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Figure 2 | Mil/Tm3 respond selectively to brightness increments while 
Tm1/Tm2 respond selectively to brightness decrements. a, Average evoked 
responses of Mil (n = 7) and Tm3 (n = 10) in the L1 pathway, in response to 
200 ms, four consecutive 500 ms and 1s full-field flashes of light from dark. 
Thick lines indicate the mean; shaded region indicates s.e.m. b, Averaged 
evoked responses (+ s.e.m.) of Tm1 (m = 10) and Tm2 (n = 11) in the L2 
pathway in response to the same stimuli. 
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rapid sequential presentations of light caused repolarization of these 
cells while their membrane potential was still peaking or decaying from 
a previous flash (Fig. 2b middle). Thus, Mil and Tm3, the postsynaptic 
targets of L1, respond mostly to brightness increments. Conversely, Tm1 
and Tm2, the postsynaptic targets of L2, respond most strongly to bright- 
ness decrements, consistent with calcium imaging studies of Tm2 (ref. 18). 
All four cells showed asymmetries in their responses to brightness changes, 
consistent with a role in conferring edge selectivity to each pathway. In 
addition, we examined whether these medulla neurons could relay long- 
term information about contrast to downstream circuitry by charac- 
terizing responses evoked by 5 s brightness increments or decrements 
presented on an intermediate grey background (Extended Data Fig. 2a—d). 
All four neurons displayed a sustained response for both brightness incre- 
ments and decrements, consistent with observations that motion res- 
ponses can be evoked even when a sequential change in luminance at 
two points in space occurs with a delay period of up to 10s in experi- 
ments using apparent motion stimuli*”. 

In HRC models, the individual inputs to the cells that perform the 
nonlinear amplification step are not themselves direction selective. We 
therefore tested the responses of Mil, Tm3, Tm1 and Tm2 to motion 
stimuli, using light and dark bars moving in different directions, under 
conditions that evoke strong responses from lobula plate tangential cells”” 
(Extended Data Fig. 3). All four neurons responded to moving bars with 
a sharp depolarization (Extended Data Fig. 3a, b) but the amplitude of 
these responses was independent of the direction of motion (Extended 
Data Fig. 3c). Thus, Mil, Tm3, Tm1 and Tm2 are not direction selective 
under these conditions, consistent with these cells acting upstream of the 
nonlinear correlation stage of motion detection, as recently reported for 
Tm2 (ref. 18). 

We next examined whether Mil and Tm3, or Tm1 and Tm2, have 
different response latencies that would allow them to differentially delay 
responses to contrast changes. To quantitatively capture the responses 
of these neurons to dynamic stimuli spanning a wide range of contrast 
values and time-scales, we used an approximately Gaussian-distributed 
random flicker stimulus with a 50% contrast (standard deviation) and 
an exponential correlation time of 10 ms (see Methods). We used stan- 
dard procedures to extract the linear filter that best described the tem- 
poral properties of the response”’”” (see Methods). The responses of Mil 
and Tm3 to the noise stimuli were very similar (r = 0.91 between mean 
response traces), with temporal filters that comprised a large positive 
lobe reflecting a sign-conserving relationship between the contrast input 
and the neural response (Fig. 3a, b left (arrow) and Extended Data Figs 4a 
and 5a, b). Mean Tm1 and Tm2 responses to these noise stimuli were 
also similar to one another (r = 0.90), with temporal filters that included 
a large negative lobe, reflecting a sign-inversion between the contrast 
input and the neural response (Fig. 3d, e left, arrow; Extended Data Figs 4b 
and 5e, g). For Mil the average peak response time was 71 ms after a 
contrast change (s.e.m. = 3.8 ms) while it was 53 ms (s.e.m. = 5.2 ms) 
for Tm3. Thus, a difference of 18 ms existed between the peak times of 
the filters for Mil and Tm3 (Fig. 3b right; P< 0.01). Similarly, the average 
peak time was 56 ms (s.e.m. = 3.8 ms) for [m1 and 43 ms (s.e.m. = 2.7 ms) 
for Tm2. The difference in latency between the two cells was 13 ms (Fig. 3e 
right; P< 0.002). Thus, in both cases, there was a small but significant 
temporal offset, with Mil exhibiting a delayed response compared with 
Tm3, and Tm1 being delayed relative to Tm2. Notably, these peak delay 
differences are not much smaller than delays inferred from some lobula 
plate tangential cells recordings and behavioural experiments*”*™. 

We next asked whether neuronal responses to this stochastic stimulus 
were linear or whether different gains were applied to brightness incre- 
ments and decrements. The noise stimulus was convolved with the cor- 
responding filters for each neuron type to obtain the predicted linear 
response of each neuron. We then compared the linear predictions with 
the actual response to define the instantaneous nonlinearity for each neu- 
ron. Consistent with the light step results, the nonlinearities extracted for 
Mil and Tm3 revealed that these cells respond more strongly to brightness 
increments than to decrements (Fig. 3c and Extended Data Fig. 5b, d). 
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Figure 3 | Mil/Tm3 and Tm1/Tm2 respond with different delays and 
nonlinearities to a Gaussian noise stimulus. a, Top: 2 s excerpt of the intensity 
signal from a 10 s full-field Gaussian noise stimulus. Signal correlation time was 
10 ms (see Methods). Middle: average voltage response (+ s.e.m.) of Mil 

(n = 7) and Tm3 (n= 11) to the 2s noise stimulus on top. The black trace 
corresponds to the average predicted linear response (+ s.e.m.) obtained by 
convolving the stimulus with the filters in (b). Bottom: overlay of the Mil and 
Tm3 responses showing the high similarity in their response. b, Left: average 
linear filters extracted from the data in a that best predict the measured 
response of Mil and Tm3 as a function of preceding light intensity changes 
(+ s.e.m.). The filters comprise a large positive lobe (arrow) and shallow 
negative lobe (see Extended Data Fig. 5). Right: box plots of the distribution of 
the timing of the peak responses of the Mil and Tm3 neurons. There is, on 
average, an 18 ms delay between the peak of Mil filters and Tm3 filters. Black 
line is the median, coloured line is the average. c, Average actual responses of 
Mil and Tm3 plotted against their average linear predicted responses. Error 
bars, s.e.m. A line of slope 1 is shown in black. d-f, Same as above for Tm1 
(n = 15) and Tm2 (n = 14). The filters comprise a large negative lobe (arrow) 
and a shallow positive lobe (Extended Data Fig. 5). The average peak delay 
between the peak of Tm1 and Tm? filters is 13 ms. 


Similarly, both Tm1 and Tm2 neurons were less hyperpolarized in res- 
ponse to brightness increments and more depolarized in response to 
brightness decrements than predicted by the linear model (Fig. 3f and 
Extended Data Fig. 5f, h). The noise stimuli evoked smaller response asym- 
metries than those observed with brightness steps, possibly because these 
stimuli use smaller changes in intensity than our step stimuli. Such dif- 
ferences in gain for brightness increments and decrements reflect par- 
tial half-wave rectification, a central feature of models that selectively 
respond to one contrast polarity**"°*. 

Can the dynamics of the linear filters and the extent of the nonline- 
arities we measured account for well-characterized properties of motion 
detecting pathways? One hallmark of the HRC is that it displays a peak 
response to a defined temporal frequency, creating a temporal frequency 
optimum. Because of its structure, the output of an HRC is not propor- 
tional to the speed of motion, but rather increases to a maximum value, 
before decaying at faster speeds. The shape of this tuning curve depends 
on the temporal properties of its two input channels”*. We constructed 
two separate model correlators, one that used Mil and Tm3 filters and 
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nonlinearities as the two channels preceding multiplication and sub- 
traction, and a second one that used Tm1 and Tm? filters and nonline- 
arities (Fig. 4a and data from Extended Data Fig. 6). We examined whether 
these model motion detectors produced temporal frequency tuning curves 
similar to those previously measured in flies. When we presented these 
models with moving sine waves of 20% contrast at various contrast fre- 
quencies, we observed a peak response at approximately 1 Hz for both 
the Mil/Tm3 and the Tm1/Tm2 models (Fig. 4b). This computed tem- 
poral frequency optimum compares favourably with the optima mea- 
sured in blowflies and Drosophila’. 

Another measured feature of these two motion pathways is their selec- 
tivity for edges of particular contrast polarity. We presented our model 
correlators with light and dark edges of 100% contrast, moving across 
a grey background. The Tm1/Tm2 model was highly selective for dark 
edges over a range of speeds. The Mil/Tm3 model was only mildly selec- 
tive for light edges, owing to the more linear responses measured in Mil 
and 'Tm3 compared with Tm1 and Tm2 (Fig. 4c). These modelling results 
are consistent with experimentally measured high selectivity of the dark 
edge motion pathway”’, and a more modest selectivity of the light edge 
motion pathway’. 

Taken together, our data are consistent with a model in which Mil 
and Tm3 represent central components of the input channels of a cor- 
relator detecting moving light edges, while Tm1 and Tm2 represent anal- 
ogous components for a correlator that is tuned to detect moving dark 
edges. The asymmetric responses of these four neurons to brightness 
increments and decrements corroborates previous studies that argued 
for separate processing of ON and OFF inputs by distinct channels'*”° 
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Figure 4 | Modelling Mil/Tm3 and Tm1/Tm2 as the delayed and non- 
delayed channels of light edges and dark edges correlators. a, Left: the 
average filters and nonlinearities from the receptive field stochastic data set 
(200 s noise presentation, 50 ms correlation time, Extended Data Fig. 6) were 
used to model Mil and Tm3 as the delayed and non-delayed channels of a 
correlator model in bottom left. Right: same as left with Tm1 and Tm2 as 
the delayed and non-delayed channels of correlator model. b, Computed 
normalized temporal frequency tuning curves obtained numerically for the 
Mil/Tm3 model correlator (left) and the Tm1/Tm2 model correlator (right) 
using sine waves of different temporal frequencies. c, Computed normalized 
response of the Mil/Tm3 model correlator (left) and the Tm1/Tm2 model 
correlator (right) to light or dark edges of 100% contrast, moving at a range 
of speeds. 
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to explain the segregation between light and dark edge processing. This 
situation is similar to separate processing of ON and OFF signals by 
bipolar cells in the vertebrate retina”” 

The relative delays we measured between the peak responses in these 
cells is roughly ten times smaller than previously calculated for idealized 
motion detector models that fit a host of experimental data. In classic 
HRC models, input to one channel is not filtered, while input to the 
second channel is low-pass filtered with a time constant of t. In these 
models, the maximum response occurs at a temporal frequency of 1/211, 
so that the delay fora 1 Hz optimum is t ~ 150 ms”°. However, since both 
filters we measured act as band-pass filters, they suppress high-frequency 
inputs, while still producing delay differences between the channels. Thus, 
when input channels contain both these measured filters, a peak timing 
difference of ~15 ms can result in a temporal frequency optimum of 1 Hz 
(see Methods and Extended Data Fig. 7). Furthermore, two considera- 
tions might lengthen the actual relative delays between pathways. First, 
we performed somatic recordings that may only approximate the true 
axonal response of the neurons. Second, the synapses between Mil/Tm3 
and T4, and those between Tm1/Tm2 and T5 could impose additional 
delays to either input channel before a correlation operation. 

Anatomical reconstruction of the Drosophila medulla connected the 
predicted spatial receptive fields of Mil and Tm3 cells to the dendritic 
arbors of individual T4 cells with known directional preferences*. Accord- 
ing to predictions derived from that analysis, if Mil signals are delayed 
relative to those of Tm3, as our recordings indicate, the observed direc- 
tion selectivity in T4 could be accomplished by combining Mil and Tm3 
inputs with opposite signs onto T4 (one inhibitory and the other excit- 
atory). Such an arrangement could be similar to the motion detection 
model proposed to explain direction selective responses in the verte- 
brate retina’®”. 

Given the cellular and synaptic complexity of medulla circuitry, as 
well as the wealth of distinct behaviours that are guided by visual motion, 
additional cell types are likely to play computational roles in Drosophila 
elementary motion detectors. Nonetheless, our data show that Mil, Tm3, 
Tm1 and Tm2 possess response properties that are consistent with imple- 
menting the algorithmic steps that precede the correlation operations 
in the motion detecting pathways in Drosophila. 


METHODS SUMMARY 


Green fluorescent protein (GFP)-targeted recordings were performed as described 
previously” under visual control using an Olympus microscope and a X60 objective. 
Electrophysiology data were collected using Igor (Wavemetrics) running mafPC 
(courtesy of M. A. Xu-Friedman). 

The drivers 686b-Gal4 and Bsh-Gal4 were used to target Mil neurons, R13E12- 
Gal4 for Tm3, 27b-Gal4 for Tm1, otd-Gal4 for Tm2 (see Methods for details about 
Gal4 driver lines). A cytosolic variant of UAS-GFP (a gift from G. Turner) was expressed 
under the control of the drivers to visualize the neurons. 

Visual stimulation used a digital light projector coupled to a coherent fibre optic 
bundle to project images on a screen as described previously’. 

Data were analysed using custom-written Matlab code. Filter extraction used stan- 
dard methods”. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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miR-34a blocks osteoporosis and bone metastasis by 
inhibiting osteoclastogenesis and Tgif2 


Jing Y. Krzeszinski', Wei Wei', HoangDinh Huynh’, Zixue Jin', Xunde Wang’, Tsung-Cheng Chang”, Xian-Jin Xie*, Lin He’, 
Lingegowda S. Mangala®’, Gabriel Lopez-Berestein”*, Anil K. Sood®”°, Joshua T. Mendell*? & Yihong Wan'? 


Bone-resorbing osteoclasts significantly contribute to osteoporosis 
and bone metastases of cancer’. MicroRNAs play important roles 
in physiology and disease**, and present tremendous therapeutic 
potential®. Nonetheless, how microRNAs regulate skeletal biology 
is underexplored. Here we identify miR-34a as a novel and critical 
suppressor of osteoclastogenesis, bone resorption and the bone me- 
tastatic niche. miR-34a is downregulated during osteoclast differ- 
entiation. Osteoclastic miR-34a-overexpressing transgenic mice exhib- 
it lower bone resorption and higher bone mass. Conversely, miR-34a 
knockout and heterozygous mice exhibit elevated bone resorption 
and reduced bone mass. Consequently, ovariectomy-induced osteopo- 
rosis, as well as bone metastasis of breast and skin cancers, are dimin- 
ished in osteoclastic miR-34a transgenic mice, and can be effectively 
attenuated by miR-34a nanoparticle treatment. Mechanistically, we 
identify transforming growth factor-B-induced factor 2 (Tgif2) as 
an essential direct miR-34a target that is pro-osteoclastogenic. Tgif2 
deletion reduces bone resorption and abolishes miR-34a regulation. 
Together, using mouse genetic, pharmacological and disease mod- 
els, we reveal miR-34a as a key osteoclast suppressor and a potential 
therapeutic strategy to confer skeletal protection and ameliorate bone 
metastasis of cancers. 

Weexamined the levels of several cancer-related microRNAs (miRNAs) 
during a time course of bone marrow osteoclastogenesis assay (Fig. 1a). 
While the expression of an osteoclast marker tartrate-resistant acid phos- 
phatase (TRAP) was rapidly increased by RANKL and further elevated by 
rosiglitazone”® (Fig. 1b), miR-34a was rapidly downregulated by RANKL 
and further diminished by rosiglitazone (Fig. 1c). The levels of miR-34b/c, 
two other members in the miR-34 family, were unaffected and expressed 
at much lower levels than miR-34a (Fig. 1d). 

The sequence of miR-34a is evolutionarily conserved and identical 
in mice and humans. Osteoclast differentiation both from mouse bone- 
marrow precursors (Fig. le, f) and from human peripheral blood mono- 
nuclear cells (hPBMN) (Fig. 1g—j) was inhibited by a miR-34a precursor 
(pre-miR-34a) but enhanced by an antisense miR-34a inhibitor (anti- 
miR-34a), indicating that miR-34a regulation of bone resorption in mice 
will probably translate to human pathophysiology. 

We generated osteoclastic miR-34a transgenic mice using CAG34a 
mice (Fig. 2a) and Tie2-cre mice’*. Fluorescence-activated cell sorting 
(FACS) and imaging showed that osteoclast progenitors from the 34a- 
Tie2-Tg (CAG34a* Cre) mice were converted to GFP* LacZ whereas 
the controls (CAG34a* Cre ) remained GEP’ LacZ* (Extended Data 
Fig. 1a, b). Northern blot confirmed the overexpression of mature-miR- 
34a in the bone marrow of 34a-Tie2-Tg mice (Extended Data Fig. 1c). 

Osteoclast differentiation assay reveals that the higher levels of mature 
miR-34a in the 34a-Tie2-Tg cultures resulted in a lower induction of 
osteoclast markers, diminished number/size of mature osteoclasts and 
reduced resorptive activity, whereas precursor proliferation or survival 


was unaltered (Fig. 2b and Extended Data Fig. 1d-g). Consequently, 
serum bone resorption marker CTX-1 (carboxy-terminal telopeptides 
of type I collagen) and osteoclast number were decreased, whereas oste- 
oblast number, bone formation rate and mineral apposition rate were 
unaltered (Fig. 2c and Extended Data Fig. 1h, i). 

Micro-computed tomography of the proximal tibiae showed that 34a- 
Tie2-Tg mice had increased bone mass and decreased structure model 
index (SMI), which quantifies the relative amount of plates (SMI = 0, 
strong) and rods (SMI = 3, fragile) (Fig. 2d, e). Cortical BV/TV was also 
higher (Fig. 2f). Moreover, miR-34a transgenic mice generated by three 
other osteoclast-targeting cre drivers also exhibited a similar phenotype 
(Extended Data Figs 2 and 3). Thus, miR-34a in the osteoclast lineage aug- 
ments bone mass by suppressing osteoclastogenesis and bone resorption. 

To determine whether miR-34a is a physiologically relevant regula- 
tor of bone resorption, we next examined miR-34a knockout (34a-KO) 
and heterozygous (34a-Het) mice (Fig. 2g). Northern blot confirmed 
the diminished levels of miR-34a in 34a-KO (Extended Data Fig. 4a). 
Consistent with prior reports”’®, miR-34a deletion had no overt effect 
on mouse development. Osteoclast differentiation was augmented in 
34a-Het and 34a-KO cultures, whereas precursor proliferation or sur- 
vival was unaffected (Fig. 2h and Extended Data Fig. 4b-e). Asa result, 
serum CTX-1 and osteoclast number were elevated (Fig. 2iand Extended 
Data Fig. 4g, h). Micro-computed tomography revealed that 34a-KO 
and 34a-Het mice exhibited a low bone mass with decreased connec- 
tivity density and increased SMI (Fig. 2j-l). Global miR-34a deletion also 
decreased bone formation as the serum marker P1NP (amino-terminal 
propeptide of type I procollagen), osteoblast number, bone formation 
rate and mineral apposition rate were reduced (Fig. 2m and Extended 
Data Fig. 4g, h). The increased resorption in 34a-Het indicates that miR- 
34a function is haploinsufficient and sensitive to dosage reduction. The 
recently published miR-34abc triple knockout (34abc-TKO)” and full 
miR-34a KO” also showed a similar phenotype (Extended Data Fig. 5a-h), 
which validates our miR-34a gene trap mice and strengthens the find- 
ing that miR-34a loss-of-function elevates bone resorption. 

Bone marrow transplantation showed that wild-type (WT) mice receiv- 
ing 34a-KO marrow also exhibited higher CTX-1 (Extended Data Fig. 4f) 
compared with WT mice receiving WT marrow. Furthermore, osteo- 
clastic miR-34a conditional knockout mice (34a-Tie2-KO) also exhibited 
elevated osteoclast differentiation and bone resorption, but unaltered bone 
formation, leading to a decreased bone mass (Extended Data Fig. 5i-n). 
Thus, miR-34a deletion in the osteoclast lineage elevates bone resorption. 

Our genetic findings prompted us to investigate whether pharmaco- 
logical administration of a miR-34a mimic can attenuate postmenopausal 
osteoporosis using an ovariectomy (OVX) mouse model and a chitosan 
(CH) nanoparticle vehicle. Reduction of uterine weight in all ovariecto- 
mized mice indicated effective oestrogen depletion (Fig. 3a). Unaltered body 
weight indicated the absence of obvious toxicity from CH nanoparticles 
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Figure 1 | miR-34a suppresses osteoclastogenesis ex vivo. a, Diagram of 
bone marrow osteoclast differentiation assay; rosi, rosiglitazone. b-d, TRAP 
expression (b) and mature miRNA levels (c, d) (n = 3). e, f, Osteoclast 
differentiation was decreased by pre-miR-34a (e) but increased by anti-miR- 
34a (f) (n = 3). Left, mature miR-34a levels; right, TRAP expression; bottom, 
images of TRAP-stained cultures; mature osteoclast numbers (black) and 


resorptive activity (blue). Scale bar, 25 um. V, vehicle; R, RANKL. 

g-j, Human RANKL-mediated osteoclast differentiation from hPBMN cells 
(n = 4). g, Mature miR-34a levels. h, TRAP expression. i. Mature osteoclast 
numbers. j. TRAP staining and resorptive activity. Scale bar, 25 jum. 

Error bars, s.d. *P < 0.05, **P < 0.01, ***P < 0.005, ****P < 0.001; 

NS, non-significant. 


e # 
Contiol J ucT WT 3da-Het 3da-KO 
1.5 < Control = 60 
Cy z= GS 34a-Tie2-Tg e a 
3 wwe £20 = 3 
« 1.0 £40 3 
E = = me 8 
g Ee 7 & ee 
2 1.0 : 
Ss o +r O 
@ 0.5 2 t5 220 
& 2 | ife 
oc s hk ieee 2 
iva NS aoe ao & 
0.0 0.0 25" o 2 
Vv R_ R+Rosi ° 5 
E 
Control CT 34a-Tie2-T. 80 8 
- 8 s = ee © Control € 
7 — 60 es = 0.04 @® 34a-Tie2-Tg 
5 E = 
$ & : 
> 40 s 
3 a 5 0.02 
£ mm 20 = k,, ow 
0.4, 34a-Het 
0 0.00 Bw 34a-kO ae 
f — 40 <—8 is 
600 7 0.6 2 ¢ eetae OE in 
€ 2.0 é S02 ye = 
£ 400 ~ i am 04 (a) Q 20 a4 
3 a g 2 
; 1.0 2 
5 200 5 02 0.0 t) 0 
8 Oo 
i) 0.0 0.0 
0.20 _ 9005 7 
AATAAA me: “ 
F : € bia E 2.0 
g Genomic Genomic S E 6004 cad = 
DNA = = # 
BNA 50.10 3 Fe 
Ext bke2 -— — BB oe = ann Dig ill 
el i 7 
wt | 34a-KO | Fe & 
Truncated 0.00 0 0.0 
Full-length Face 
pri-miR-34a AMA, pri-miR-34a II_ ES 
h i 80 I m 
e S$ |B yieret a wr 3da-Het 34a-KO £ oe 6 £30 
x 04 & 47 34a-KO a ER z > 60 ane 2 Ee is 
i i$ oD 
x a = £20 
F & se x 40 am 4 3 ae 
2 02 F 0.2 5 T 3 E40 
2 ~ = ea Pisce thd | Ears ae § 20 a E 
= 2 = [il 93+8 277419" = 292 + 25% 5 e 3 
0.0 0.0 sin 0.19+0.02 0.38+0.03* 0.42+0.02" @ 6 0.0 a 0 
Vv R R + Rosi 


Figure 2 | miR-34a inhibits bone resorption and increases bone mass in 
vivo. a, Diagram of the conditional miR-34a transgene (CAG-34a). b, 34a- 
Tie2-Tg cultures showed decreased osteoclast differentiation (n = 3). Left, 
miR-34a levels; middle, TRAP expression; right, TRAP staining, osteoclast 
numbers (black) and resorptive activity (blue). Scale bar, 25 um. c, Serum CTX- 
1 (2-month-old, male, n = 5). d-f, Micro-computed tomography (j1CT) of 
the tibiae (2-month-old, male, n = 4). d, Images of the trabecular bone of 
the tibial metaphysis (top) (scale bar, 10 jm) and the entire proximal tibia 
(bottom) (scale bar, 1 mm). e, Trabecular bone parameters. BV/TV, bone 
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volume/tissue volume ratio; BS/BV, bone surface/bone volume ratio; Tb. th., 
trabecular thickness; Tb. sp., trabecular separation; Conn. d., connectivity 
density. f, Cortical BV/TV. g, Diagram of miR-34a gene-trap knockout. h, 34a- 
KO and 34a-Het cultures showed enhanced osteoclast differentiation (n = 3). 
i, Serum CTX-1 (2-month-old, male, n = 6). j-I, Micro-computed tomography 
of the tibiae (2-month-old, male, n = 4). j, Images. k, Trabecular bone 
parameters. Tb. n., trabecular number. 1, Cortical BV/TV. m, Serum P1NP 
(2-month-old males, n = 6). Error bars, s.d. *P < 0.05, **P < 0.01, 

EP < 0.005, ****P < 0.001; NS, non-significant. 
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Figure 3 | miR-34a attenuates osteoporosis and cancer bone metastases. 
a-f, OVX or sham operation was performed on 10-week-old female mice. 
Three days after surgery, the OVX mice were treated with miR-34a-CH (34a) 
or miR-Ctrl-CH (Ctrl) at 5 ug per mouse twice a week for 5 weeks (n = 5). 

a, Uterine weight. b, Body weight. c, Serum CTX-1. d, Serum P1NP. 

e, Micro-computed tomography images. f, Trabecular bone parameters. 

g, miR-34a levels in each tissue from miR-34a-CH- vs. miR-Ctrl-CH-treated 
mice 72h after a single injection (n = 3). Top, mature miR-34a levels; bottom, 
fold induction. BM, bone marrow. h-k, 34a-Tie2-Tg mice or controls 
(3-month-old, female, n = 7) were subjected to OVX and analysed 5 weeks after 
surgery. h, Uterine weight. i, Serum CTX-1. j, Serum P1NP. k, BV/TV by 
micro-computed tomography. 1, Xenograft of MDA231-BoM-1833 cells into 
34a-Tie2-Tg (n = 8) or control (n = 9). m, Allograft of B16-F10 cells into 


(Fig. 3b). Compared with sham controls, OVX mice treated with miR- 
Ctrl-CH showed increased CTX-1 and decreased P1NP, whereas both 
effects were largely prevented in OVX mice treated with miR-34a-CH 
(Fig. 3c, d). Consequently, OVX-induced bone loss was attenuated by 
miR-34a-CH (Fig. 4e, fand Extended Data Fig. 6a). miR-34a-CH also 
decreased bone resorption and increased bone formation in sham con- 
trols, leading to a higher bone mass (Extended Data Fig. 6b-d). Biodis- 
tribution analysis showed that miR-34a level in the bone marrow was 
the highest, and further increased fivefold by miR-34a-CH, indicating 
an efficient miR-34a delivery (Fig. 3g). 

In addition to acute systemic miR-34a treatment, we also examined 
the effects of chronic osteoclastic miR-34a overexpression. OVX-induced 
bone resorption and bone loss were also attenuated in 34a-Tie2-Tg mice 
without altering OVX effects on bone formation (Fig. 3h-k and Extended 
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34a-Tie2-Tg (n = 4) or control (n = 6). n, MDA231-BoM-1833 cells in 34a-KO 
(n = 6), 34a-Het (n = 6) or control (n = 6). 0, B16-F10 in 34a-KO (n = 6), 34a- 
Het (n = 4) or control (n = 6). p-r, Bone metastasis of MDA231-BoM-1833 
cells was attenuated by miR-34a-CH delivered 3 days after xenograft at 10 pg 
per mouse twice a week for 5 weeks (1 = 5). p, Bioluminescence imaging (BLI) 
signal. q, Left, BLI images; right, number and size of bone metastases (met). 
r, X-ray images and histology images for TRAP and ALP (alkaline phosphatase) 
staining. Arrows, osteolytic lesions. s, Bone metastasis of B16-F10 cells was 
attenuated by miR-34a-CH delivered at 5 1g per mouse twice a week for 

4 weeks starting 1 week before cancer cell injection (n = 8). I-s, Statistical 
analyses used a Mann-Whitney U test and are shown as mean = s.d. with 

P values illustrated. a, c, d, f, h-I, n-p, P< 0.05 by analysis of variance. 

*P < 0.05, **P < 0.01, ***P < 0.005, ****P < 0.001; NS, non-significant. 


Data Fig. 6e). These results indicate that osteoclastic miR-34a overexpres- 
sion is sufficient to impede osteoporosis, and the osteoclast is a key site 
for miR-34a therapeutic benefit. 

To determine whether osteoclastic miR-34a confers protection from 
bone metastases, we employed two cancer-cell-cardiac-injection models. 
First, a human breast cancer cell line (MDA231-BoM-1833) was xenografted 
into female nude mice. This model allowed us to assess cancer cells from 
human. Second, a mouse melanoma cell line (B16-F10) was allografted 
into immunocompetent male mice. This model took consideration of 
adaptive immunity. In both models, bone metastases were attenuated in 
34a-Tie2-Tg and 34a-PT-Tg mice but exacerbated in 34a-KO and 34a-Het 
mice (Fig. 3l-o and Extended Data Fig. 7a—f). Because miR-34a remained 
intact in the exogenous cancer cells, the altered bone metastases resulted 
from the altered miR-34a in the bone microenvironment of the host. 
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Figure 4 | Tgif2 is an essential miR-34a direct target and a pro- 
osteoclastogenic factor. a, Tgif2 expression was inhibited by pre-miR-34a in 
osteoclast cultures (n = 3). b, Tgif2 expression in WT and 34a-PT-Tg osteoclast 
cultures (n = 3). c, Sequence alignment of the Tgif2 3’ UTR. d, A diagram 

of Tgif2 3’ UTR reporters. e, Luciferase readout from WT or mutant Tgif2 3’ 
UTR reporter co-transfected in HEK293 cells with pre-miR-34a or anti-miR- 
34a (n = 3). f-h, Comparison of Tgif2-KO, Tgif2-Het and WT control 

mice (1.5-month-old, male, n = 7). f, Serum CTX-1. g—h, Micro-computed 
tomography of tibiae. g, Trabecular BV/TV. h, Images of the trabecular bone of 
the tibial metaphysis (scale bar, 10 um). i, Decreased osteoclast differentiation 
in Tgif2-KO and Tgif2-Het cultures (n = 3). j, Tgif2-KO cultures were resistant 
to the anti-osteoclastogenic effects of pre-miR-34a (n = 3). i-j, Top, TRAP 
expression; bottom, TRAP staining, osteoclast number (black) and resorptive 
activity (blue). k-I, Tgif2/34a double knockout (DKO) mice were compared 


Pharmacologically, we tested both a treatment protocol using the 
human breast cancer model anda prevention protocol using the mouse 
melanoma model. In both cases, bone metastases were diminished by 
miR-34a-CH (Fig. 3p-s). Systemic miR-34a-CH delivery affected neither 
tumour growth nor metastasis to other organs such as lung (Extended 
Data Fig. 8a, b). Moreover, treating only the cancer cells with miR-34a- 
CH before injection had no effect (Extended Data Fig. 8c, d). Consistent 
with the published finding that miR-34abc deletion does not increase 
tumorigenesis’’, our 34a-KO mice also showed unaltered cancer sus- 
ceptibility (Extended Data Fig. 8e). 

Since systemic miR-34a-CH treatment not only decreases bone resorp- 
tion but also increases bone formation, we examined the effects of miR-34a 
overexpression in osteoblasts. We bred the CAG34a mice with Osterix- 
CreER mice to generate 34a-Osx-Tg mice. Osteoblast differentiation 
was reduced for 34a-KO and 34a-Het mice, but increased for 34a-Osx-Tg 
mice (Extended Data Fig. 9a—d). Consequently, 34a-Osx-Tg mice exhib- 
ited a higher bone formation but unaltered bone resorption, leading to 
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with WT, Tgif2-KO or 34a-KO (2-month-old, male, n = 4). k, Osteoclast 
differentiation. 1, Serum CTX-1. m, Tgif2 mRNA in RAW264.7 cells 

after transfection of transcription factors (n = 3). n, Chromatin 
immunoprecipitation of transcription factor binding and H3K4me3 levels 

at the endogenous Tgif2 promoter in RAW264.7 cells 3 days after RANKL 
treatment (n = 6). 0, Transcription factor was co-transfected into 293 cells with 
its luciferase reporter, together with Tgif2 or a green fluorescent protein control 
(GFP ctrl) (n = 6). p, Luciferase reporter was transfected into WT, Tgif2-KO 
or 34a-KO osteoclast cultures (n = 6). q, r, NFATcl mRNA (q, 1 = 3), c-Jun 
phosphorylation and IkBa degradation (1) in WT, Tgif2-KO or 34a-KO 
osteoclast cultures. Ratios of p-c-Jun/total-c-Jun and IkBo/B-actin are shown. 
s, A model for how miR-34a suppresses osteoclastogenesis. Error bars, s.d. 
*P < 0.05, **P < 0.01, ***P < 0.005, ****P < 0.001; NS, non-significant. 


an increased bone mass (Extended Data Fig. 9e-g). Importantly, how- 
ever, the elevated bone formation alone in the 34a-Osx-Tg mice was 
insufficient to attenuate either OVX-induced bone loss or cancer bone 
metastases (Extended Data Fig. 9h, i). Together, our conditional miR- 
34a transgenic mouse models pinpointed the mechanisms underlying 
the therapeutic benefits of miR-34a by revealing that osteoclast, rather 
than cancer cell or osteoblast, is the critical and essential player. 

To elucidate the mechanisms, we identified Tgif2 as a novel direct 
miR-34a target in the osteoclast lineage (Extended Data Fig. 10a—c). Tgif2 
expression was suppressed by miR-34a gain-of-function, but increased 
by miR-34a loss-of-function, both in mouse and human osteoclast cul- 
tures (Fig. 4a, b and Extended Data Fig. 10d, e). The miR-34a seed region 
in Tgif2 3’ untranslated region (UTR) is evolutionarily conserved in mam- 
mals (Fig. 4c). Luciferase reporter assay showed that Tgif2 3’ UTR is 
sufficient to confer miR-34a regulation (Fig. 4d, e). Importantly, when 
the miR-34a seed region in the Tgif2 3’ UTR was mutated, miR-34a 
regulation was abolished (Fig. 4d, e). 
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Tgif2 expression was increased during WT osteoclast differentiation 
(Fig. 4b). Tgif2-KO and Tgif2-Het mice had lower bone resorption and 
higher bone mass (Fig. 4f-h and Extended Data Fig. 10f). Tgif2 deletion 
reduced osteoclast differentiation, and abolished the anti-osteoclastogenic 
effects of miR-34a (Fig. 4i, j). Moreover, Tgif2/miR-34a double knock- 
out mice (DKO) could no longer increase osteoclast differentiation or 
bone resorption (Fig. 4k, 1) compared with Tgif2-KO mice. These results 
indicate that Tgif2 is pro-osteoclastogenic and essential for miR-34a 
regulation. 

We next investigated how Tgif2 potentiates RANKL signalling. Trans- 
fection assays revealed that NFATc1, c-fos and c-jun, and to a lesser 
extent NF-kB (p65), could induce Tgif2 expression (Fig. 4m). Response 
elements for NFATc1 and AP-1, but not NF-«B, were identified in the 
Tgif2 promoter region (—5 to +5 kb). Chromatin immunoprecipita- 
tion analysis in osteoclast cultures showed that NFAT cl, c-jun and c-fos 
bound to these sites upon RANKL stimulation, leading to activated Tgif2 
transcription shown by the elevated H3K4me3 level at the transcription 
start site (Fig. 4n). This indicates that NFATcl and AP-1 induce Tgif2 
expression during osteoclastogenesis. 

Luciferase reporter assay showed that Tgif2 augmented the activity 
of NFATc1, NF-«B and c-Jun, but not c-fos (Fig. 40). Consistently, the 
activity of endogenous NFATc1, NF-«B and c-Jun, but not c-fos, was 
reduced in Tgif2-KO cultures and enhanced in 34a-KO cultures (Fig. 4p). 
Furthermore, NFATc1 messenger RNA (mRNA), c-Jun phosphoryla- 
tion and IkBo degradation were decreased in Tgif2-KO cultures and 
increased in 34a-KO cultures (Fig. 4q, r). Therefore, Tgif2 potentiates 
osteoclastogenesis via a positive feedback loop in which RANKL-induced 
transcription factors activate Tgif2 expression, and Tgif2 in turn promotes 
their activity. Collectively, these findings reveal Tgif2 as a novel yet cri- 
tical regulator of osteoclastogenesis and bone resorption, as well as a key 
miR-34a direct target that is essential for miR-34a regulation (Fig. 4s). 

The roles of miRNAs in bone physiology have just begun to emerge. 
Recent studies show that osteoblast-specific gain-of-function of miR-34b/c 
decreases bone mass by suppressing osteoblastogenesis and bone 
formation'’’”. Here we show that osteoclast-specific miR-34a gain-of- 
function protects bone by suppressing osteoclastogenesis and bone resorp- 
tion. These findings uncover an interesting functional divergence among 
the miR-34 family members. Our study paves the road for future dis- 
covery of other miRNAs that may be regulated by RANKL and/or con- 
trol Tgif2 expression, as well as future epidemiological and clinical studies 
to explore the pathological and therapeutic roles of this miR-34a-Tgif2 
pathway in human. 

Most systemically delivered drugs can target multiple tissues and cell 
types. We found that miR-34a also enhances bone formation; although 
miR-34a overexpression in osteoblast is neither sufficient nor essential 
for the therapeutic benefits of miR-34a in osteoporosis or bone metastasis, 
miR-34a may bea superior new therapy that exerts both anti-catabolic 
and anabolic effects compared with the current drugs that are solely 
anti-catabolic. Our identification of miR-34a, as well as the recent report 
of miR-141 and miR-219 (ref. 13), opens an exciting avenue for the devel- 
opment of a new generation of RNA-based osteo-protective medicine. 

Other miR-34a target genes have been reported in different biological 
context, such as SIRT (ref. 14), SIRT6 (ref. 15) and PNUTS"*. Although 
miR-34a may also target genes other than Tgif2 in osteoclasts, our genetic 
rescue ex vivo and in vivo shows that Tgif2 is the key miR-34a target, 
suggesting that other genes are probably secondary or functionally irrel- 
evant to osteoclastogenesis. 

miR-34a, miR-34b and miR-34c are commonly deleted in human 
cancers”. In vitro studies suggest that they may be critical mediators of 
p53 function and potential tumour suppressors’®. Surprisingly, in vivo 
studies reveal that miR-34abc triple knockout mice exhibit intact p53 
function without increased tumorigenesis’®. Nonetheless, systemic miR- 
34a administration can indeed attenuate cancer malignancy”. This raises 
the intriguing possibility that its anti-cancer effects may reside in other 
cells that constitute the tumour microenvironment such as the osteoclasts 
in the bone metastatic niche. Indeed, our findings illustrate that bone 
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metastases are effectively blocked by miR-34a in osteoclasts, thus pro- 
viding the first in vivo genetic evidence that miR-34a opposes malig- 
nant progression of cancer by disarming the metastatic niche. 


METHODS SUMMARY 


Conditional miR-34a transgenic mice were generated using the CAG-Z-EGFP vec- 
tor. miR-34a knockout mice were generated using a gene-trap embryonic stem cell 
line. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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The long-term maintenance of a resistance 
polymorphism through diffuse interactions 


Talia L. Karasov’**, Joel M. Kniskern'+*, Liping Gao', Brody J. DeYoung’, Jing Ding't, Ullrich Dubiella*, Ruben O. Lastra’, 
Sumitha Nallu', Fabrice Roux*®, Roger W. Innes’, Luke G. Barrett!+, Richard R. Hudson! & Joy Bergelson! 


Plant resistance (R) genes are a crucial component in plant defence 
against pathogens’. Although R genes often fail to provide durable 
resistance in an agricultural context, they frequently persist as long- 
lived balanced polymorphisms in nature’ *. Standard theory explains 
the maintenance of such polymorphisms through a balance of the 
costs and benefits of resistance and virulence in a tightly coevolving 
host-pathogen pair®®. However, many plant-pathogen interactions 
lack such specificity’. Whether, and how, balanced polymorphisms 
are maintained in diffusely interacting species* is unknown. Here we 
identify a naturally interacting R gene and effector pair in Arabidopsis 
thaliana and its facultative plant pathogen, Pseudomonas syringae. 
The protein encoded by the R gene RPS5 recognizes an AvrPphB 
homologue (AvrPphB2) and exhibits a balanced polymorphism that 
has been maintained for over 2 million years (ref. 3). Consistent with 
the presence of an ancient balanced polymorphism, the R gene confers 
a benefit when plants are infected with P. syringae carrying avrPphB2 
but also incurs a large cost in the absence of infection. RPS5 alleles 
are maintained at intermediate frequencies in populations globally, 
suggesting ubiquitous selection for resistance. However, the presence 
of P. syringae carrying avrPphB is probably insufficient to explain 
the RPS5 polymorphism. First, avrPphB homologues occur at very 
low frequencies in P. syringae populations on A. thaliana. Second, 
AvrPphB only rarely confers a virulence benefit to P. syringae on 
A. thaliana. Instead, we find evidence that selection for RPS5 involves 
multiple non-homologous effectors and multiple pathogen species. 
These results and an associated model suggest that the R gene poly- 
morphism in A. thaliana may not be maintained through a tightly 
coupled interaction involving a single coevolved R gene and effector 
pair. More likely, the stable polymorphism is maintained through 
complex and diffuse community-wide interactions. 

Individuals within a population often exhibit a wide range of suscep- 
tibilities to infection and this variability has a central influence on the 
emergence and spread of disease” '*. The mechanisms that maintain resis- 
tance polymorphisms within populations have therefore been a major 
focus both of theoretical and empirical research'*"*. Despite this effort, 
significant gaps persist in our understanding of how and why genetic 
variation in resistance traits is maintained. For example, current theory 
on the long-term maintenance of resistance polymorphisms assumes that 
a pathogen specializes exclusively on one host species. However, roughly 
half of all plant pathogens associate with multiple hosts’, and many spar- 
sely populated and ephemeral host species, such as A. thaliana, are rarely 
attacked by specialist pathogens. Despite the prevalence of generalist 
pathogens, ancient balanced polymorphisms in resistance are ubiquit- 
ous in A. thaliana’. We currently lack an understanding of how these 
polymorphisms are maintained. 


The interaction between A. thaliana and the model plant pathogen 
P. syringae is an ideal system to disentangle these dynamics both because 
P. syringae is a commonly occurring pathogen in wild populations of 
A. thaliana“ and because of the extensive knowledge of the molecular 
biology of this interaction’. P. syringae interacts with A. thaliana through 
the secretion of virulence-associated proteins (effectors) that downreg- 
ulate plant basal defence’. A. thaliana, in turn, employs an arsenal of 
R gene products that recognize the action of specific effector proteins, 
inducing localized cell death (hypersensitive response) and a systemic 
production of chemical defences. These interactions fit the classic gene- 
for-gene model in requiring a close match between an effector and the 
corresponding R gene allele for a resistance response. Although P. syringae 
does not specialize on A. thaliana’, multiple R genes in A. thaliana 
recognize specific P. syringae effectors and exhibit balanced resistance 
polymorphisms that have been maintained for millions of years**. 
Here we examine the dynamics of one specific gene-for-gene interaction 
in co-occurring A. thaliana and P. syringae populations to identify ecol- 
ogical and evolutionary processes underlying the long-term mainten- 
ance of resistance polymorphisms. 

We first investigated the genetic basis of recognition in the interac- 
tion between A. thaliana and one of its resident P. syringae pathogens. 
P. syringae strain PNA29.1a was isolated from a natural A. thaliana 
population“ and shown to induce variable hypersensitive response on 
different A. thaliana accessions. To identify the effector eliciting hyper- 
sensitive response, we expressed cosmid clones of the PNA29.1a genome 
ina P. syringae strain, PstDC3000 (DC3000), which does not elicit hyper- 
sensitive response in A. thaliana. A genomic fragment conferring induction 
of hypersensitive response to DC3000 was found to contain a homo- 
logue of the P. syringae effector avrPphB (also known as hopARI and 
avrPph3). Through subcloning, we confirmed that this effector, which 
we name AvrPphB2, is required for recognition. To identify the corre- 
sponding R gene, we mapped the hypersensitive response phenotype 
induced by DC3000(AvrPphB2) on an association panel of 75 A. thaliana 
accessions’° (Supplementary Table 1) and observed a single significant 
peak on chromosome 1 (Fig. 1a). The three most significant single nucle- 
otide polymorphisms (P = 1.27 X 10~ 2) fell within 3.5 kilobases (kb) of 
RPS5, which encodes an R protein known to recognize AvrPphB”’. To 
confirm that RPS5 is responsible for recognition of AvrPphB2, we gen- 
erated six isogenic pairs of resistant (RPS5*) and susceptible (RPS5_) 
plant lines and tested whether an AvrPphB2 induced hypersensitive 
response is restricted to the RPS5* hosts. All RPS5* lines expressed hyper- 
sensitive response in response to PNA29.1a after 16-24 h, whereas their 
RPS5_ isolines did not. As expected, growth of DC3000(avrPphB2) was 
significantly reduced in RPS5" plants relative to RPS5_ plants, even after 
correcting for multiple tests (P = 1.45 x 10° 3, Wilcoxon rank-sum test) 
(Extended Data Fig. 1). 
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Figure 1 | Identification of RPS5 and AvrPphB2 as a naturally interacting 
R-gene-effector pair in A. thaliana and P. syringae populations. 

a, Hypersensitive response in response to infection by DC3000(avrPphB2) was 
scored in 75 worldwide A. thaliana accessions that were previously genotyped 
at ~250,000 single nucleotide polymorphisms’*. EMMAX” was used to 
perform genome-wide association mapping of differential hypersensitive 
response. The Manhattan plot illustrates the P values associated with each of the 
single nucleotide polymorphisms, and the dotted line signifies the Bonferroni 
correction threshold for significance (P = 2.33 X 10~’). The top three single 
nucleotide polymorphisms, each with P values 1.27 X 10~ ?2 lie within 3.5 kb of 
RPS5 on chromosome (Chr) 1. b, AvrPphB2 cleaves PBS1. Immunoblot of 
PBS1 tagged with haemagglutinin (PBS1-HA) expressed alone or co-expressed 
with AvrPphB, AvrPphB C98S (non-active mutant) or AvrPphB2-myc in 
Nicotiana benthamiana. AvrPphB and AvrPphB2 both cleave the full-length 
(FL) PBS1 whereas AvrPphB C988 does not. The upper panel shows the 
immunoblotting results for the HA-tagged PBS1 and the bottom panel shows 
the results for immunoblotting for AvrPphB variants. 


AvrPphB2 exhibits relatively low identity with AvrPphB at both nucle- 
otide and protein levels (75% and 78% respectively). This divergence 
prompted us to ask whether AvrPphB2 triggers host resistance ina man- 
ner similar to AvrPphB by testing whether AvrPphB2 also cleaves the 
host protein PBS1, a virulence target of AvrPphB’®. An immunoblot assay 
demonstrated cleavage of PBS1 by AvrPphB2 (Fig. 1b). Furthermore, 
infection of RPS5* /PBS1~ plants with DC3000(avrPphB2) failed to elicit 
the hypersensitive response, confirming a conserved mode of action for 
PBS1-dependent RPS5 recognition of AvrPphB2. Thus, we find no evi- 
dence that sequence divergence in AvrPphB2 is associated with func- 
tional divergence in cleavage of PBS1. 

RPS5 was previously shown to exhibit an ancient balanced polymor- 
phism for its presence or absence in A. thaliana’. We confirmed this find- 
ing through a comparison of sequence divergence at the intergenic region 
surrounding the RPSS locus and here estimate that the polymorphism 
has been maintained in A. thaliana for approximately 2.6 million years 
(with 95% confidence intervals of 1.5-3.9 million years (refs 19, 20)). 
As shown previously’, the divergences between the Arabidopsis lyrata 
RPS5 locus and the A. thaliana presence and absence alleles are equi- 
valent, indicating that the deletion event did not skew the inference of 
polymorphism age. Furthermore, we extended this work to examine poly- 
morphisms within a broad array of A. thaliana populations worldwide 
using a PCR assay for RPS5 presence/absence. The survey of 1,198 genet- 
ically distinct plants from 357 populations revealed that RPS5 was pres- 
ent in 51% of individuals (Fig. 2). Within the 39 populations in which we 
detected four or more distinct genotypes, RPS5 was found at an average 
frequency of 56% (Fig. 2a), with both alleles present in more than 90% 
of these populations. Thus, RPS5 polymorphism is long-lived, common 
in extant populations and maintained at intermediate frequency in many 
local populations. 

This ongoing maintenance ofan ancient R gene polymorphism within 
populations suggests strongly that RPS5 confers both a fitness advantage 
and a fitness cost”', probably dependent upon the presence or absence 
of specific pathogen effector loci. To test for a cost of resistance in the 
absence of infection, we measured the relative fitness of RPS5*/RPS5~ 
isolines in a field experiment performed in the absence of avrPphB homo- 
logues, including avrPphB2. Our experimental design included six pairs 
of RPS5*'~ isolines, with replicates of each pair randomized within blocks 
ina field in Downer’s Grove, Illinois, USA. Importantly, we used RPS5*/~ 
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Figure 2 | The distribution of RPS5 and avrPphB homologues in 
co-occurring A. thaliana and P. syringae populations. PCR was used to test 
for the frequency of RPS5 in A. thaliana populations across Europe (a) and the 
Midwestern USA (Extended Data Fig. 3). The RPS5 locus was polymorphic 
in more than 90% (36 out of 39) of these populations (one population in eastern 
Asia is not illustrated). Dot-blot assays tested for the presence of AvrPphB 
homologues in isolates of P. syringae from the Midwestern USA (b) and from 
France (c). avrPphB homologues are found at 3.27% frequency in the 
Midwestern USA (RPS5 frequency ~ 24%) and at 2.29% frequency in France 
(RPSS5 frequency ~ 81%). 


isolines generated both in susceptible and resistant backgrounds (via 
Cre-lox recombination and EMS mutagenesis respectively) to allow 
assessment of fitness effects due to autoimmunity”. For the isogenic 
RPS5*’~ pairs in the susceptible and resistant backgrounds respectively, 
the RPS5* genotype produced fewer seeds (P = 0.002, P < 0.001), fewer 
siliques (P = 0.002, P < 0.001) and less plant biomass (P < 0.001 both) 
thanthe RPSS genotype (paired t-test, n = 214 in the susceptible back- 
ground and n = 478 in the resistant background). The fitness cost asso- 
ciated with the RPS5™ allele ranged from 5 to 10.2% (Fig. 3). 

The large fitness cost associated with the presence of RPS5 suggests 
that a high intensity of pathogen-mediated selection is required for the 
resistance polymorphism to be maintained. Consistent with this expec- 
tation, a fitness benefit of roughly 20% has been reported for plants car- 
rying RPS5 when infected with P. syringae containing avrPphB2”. In 
contrast, a survey examining the prevalence of avrPphB and its homo- 
logues in P. syringae isolates from A. thaliana plants in the Midwestern 
USA revealed that avrPphB homologues were restricted to 4 out of 11 
populations tested, and maintained at very low frequency (6 of 183 iso- 
lates (3.27%); Fig. 2b), despite RPS5 being found at intermediate fre- 
quencies in this region (24% of plants in US populations; Extended 
Data Fig. 2). A. thaliana was introduced relatively recently into North 
America, and the ancient balanced polymorphism predates this intro- 
duction. To determine whether frequencies of avrPphB homologues are 
greater in Europe where it is naturalized, we tested for the presence of 
AvrPphB homologues in 131 isolates from the extended P. syringae spe- 
cies complex’’ from 19 A. thaliana populations across France (Fig. 2c). 
We found putative distant homologues in three (2.29%) of these isolates. 
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Figure 3 | The cost of RPS5-mediated resistance in the absence of AvrPphB 
homologues. Field trials of paired isogenic lines of A. thaliana that differed 
only in the presence or absence of RPSS driven by its natural promoter. Lines C1 
and C2 were generated by the insertion of RPS5 and its flanking regions into 
the susceptible Ga-0 background and subsequent excision of RPS5 using a 
Cre-lox system. Lines E1-E4 were generated by random point mutation in the 
native RPS5 gene to generate a null allele in the resistant Col-0 background 
(described in Supplementary Information). In all pairings, RPS5* plants 
exhibited reduced fitness relative to RPS5 plants, ranging in magnitude from 
5.0 to 10.2%. The percentage indicates the percentage decrease in seed 
production in the RPS5* line relative to the RPS5 line, and the stars denote 
P<0.05 ina paired t-test (with 51 and 56 plant pairings in the two susceptible 


backgrounds, and 59, 52, 66 and 62 plant pairings in the resistant backgrounds). 
Results are presented as the mean + one s.e.m. 
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Despite the rarity of AvrPphB homologues, RPS5* iscommon in French 
populations (81% of plants in French A. thaliana populations possess 
RPS5). These results again suggest that a pairwise interaction between 
A. thaliana and P. syringae carrying homologues of AvrPphB may be 
insufficient for the maintenance of RPS5. 

Current models to explain the maintenance of polymorphism in antag- 
onistic species interactions typically assume tightly coupled interactions 
between specific host and enemy genotypes”*®. However, single R genes 
may recognize and interact with multiple pathogen effectors’™*. We 
hypothesized that the high frequency of RPS5 within A. thaliana popula- 
tions may result from selection imposed by effectors other than AvrPphB 
homologues. We tested this idea by examining differential hypersensi- 
tive responses between RPS5‘’~ isogenic plants when infected with 42 
A. thaliana-isolated P. syringae strains from the Midwestern USA. These 
strains were chosen to lack avrPphB and its homologues on the basis of 
dot-blot hybridizations. One strain differentially induced hypersensi- 
tive response on RPS5" plants, suggesting that RPS5 may encode a protein 
that has the capacity to recognize non-homologous effectors (Supplemen- 
tary Table 2). However, the low frequency of P. syringae isolates with any 
effector recognized by RPSS5 in these 42 isolates (<5%) suggests addi- 
tional sources of selection for resistance. 

A. thaliana is host toa diverse bacterial community within its leaves”. 
A selective advantage for RPS5 could result if multiple A. thaliana path- 
ogens carry AvrPphB homologues or non-homologous effectors recog- 
nized by RPS5. In support of the former, sequence analysis suggests the 
potential for horizontal transfer of avrPphB among bacterial species. A 
comparison of the genomes of a P. syringae kiwi crop pathovar and an 
A. thaliana pathovar revealed that synonymous divergence between 
avrPphB and avrPphB2is substantially greater than divergence between 
other shared homologues (synonymous divergence = 2.28 versus an aver- 
age synonymous divergence of 0.60; empirical P = 0.003) (Extended Data 
Fig. 3). The unusually ancient coalescence between avrPphB homologues 
raises the possibility that at least one AvrPphB homologue was horizon- 
tally transferred from another bacterial species. If AvrPphB homolo- 
gues persist in another species, interactions between A. thaliana and 
this other species could contribute to selection for RPSS5. 
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Spillover of pathogens from alternative neighbouring hosts has been 
shown to influence resistance traits in plant populations’. In the agricul- 
tural and disturbed ecosystems where A. thaliana is commonly found, 
Arabidopsis exists as a small ephemeral that is a less predictable and pro- 
ductive host than co-occurring perennials and crops. Other hosts can 
provide strong selection on the virulence of P. syringae; indeed, isolates 
of P. syringae collected from other hosts consistently growin A. thaliana 
where they can demonstrate unusually high virulence”. Infrequent infec- 
tion by highly virulent strains of P. syringae could impose strong selection 
for the ability to recognize these pathovars. It is unknown, however, if 
RPSS recognizes effectors in a wide breadth of pathovars. 

To determine whether avrPphB and its homologues are present in 
strains isolated from alternative hosts, we probed a panel of 78 P. syringae 
strains isolated from crops. Successful hybridization to an avrPphB probe 
was observed for 21 out of 78 (27%) of the pathovar strains (Supplemen- 
tary Table 3). We cloned 14 of these homologues, expressed them in 
DC3000 and tested for RPS5-mediated recognition. These homologues 
were chosen to span diverse P. syringae clades and thus provide a rep- 
resentative sample of how A. thaliana and P. syringae interact at the RPS5/ 
avrPphB interface, at least in an agricultural context. Of course, selection 
imposed by crop pathovars cannot explain a polymorphism that pre- 
dates the advent of agriculture. Nevertheless, these pathovars provide a 
window into how effectors carried by non-resident strains interact with 
A. thaliana. Approximately 70% (10 out of 14) of these homologues were 
recognized, indicating that non-resident P. syringae can be subject to 
RPS5-mediated resistance (Extended Data Fig. 4). Although the frequency 
of pathogenic spillover onto A. thaliana populations is not known, this 
result demonstrates that pathovars of P. syringae have the potential to 
impose selection favouring the presence of RPS5 in A. thaliana populations. 

The low frequency of avrPphB in P. syringae residing within the leaves 
of A. thaliana suggests a limited fitness advantage for carrying this gene. 
We compared growth of DC3000 carrying an empty plasmid with the 
growth of DC3000(avrPphB2) (Extended Data Fig. 5), and found that 
AvrPphB2? significantly enhanced growth within an RPS5 accession (Wil- 
coxon rank-sum test, P = 0.04). The common laboratory strain DC3000 
is a crop pathovar highly diverged from most A. thaliana isolates of 
P. syringae’’; consequently, the virulence effect of AvrPphB2 may differ in 
the endemic P. syringae isolates. Similar experiments using each of three 
P. syringae isolates collected from A. thaliana populations revealed a 
virulence benefit in only one of the three isolates (Extended Data Fig. 6) 
(Wilcoxon rank-sum tests; P = 0.01, 0.90 and 0.50 respectively). Virulence 
benefits that are dependent on the genetic background help explain the 
relatively low frequency of avrPphB2 within A. thaliana populations. 

These observations are not consistent with the tight R gene and effector 
pairing thought necessary to generate stable resistance polymorphisms*”. 
Our data instead suggest that selection favouring RPSS is diffuse, deriv- 
ing from multiple and perhaps individually small selective agents. Sim- 
ilarly, the frequency of the effector in the pathogen may be only weakly 
coupled to the frequency of RPSS in A. thaliana. This is particularly true 
if P. syringae isolates intermittently and transiently spill over from alter- 
native hosts to infect A. thaliana populations. This inconsistency between 
longstanding theory and our observations raises the possibility that dif- 
fuse interactions between A. thaliana and P. syringae may generate stable 
balanced polymorphisms. 

To test this possibility, we modelled the dynamics of a resistance 
polymorphism under a range of parameters for a diffusely interacting 
host-pathogen pair. More specifically, we asked whether a balanced poly- 
morphism could be maintained when the frequency of an R gene poly- 
morphism depends upon the frequency of recognized effectors, but the 
frequency of effectors is uncoupled from R gene dynamics. To determine 
the conditions under which the polymorphism is stably maintained 
(stability is defined here as maintenance = 100,000 generations), we 
modelled the dynamics of an R gene polymorphism across a range of 
values for the cost and the frequency of infection (Fig. 4). Next, we incor- 
porated frequency dependence as a mechanism to couple effector and 
R gene dynamics asymmetrically without explicitly modelling effector 
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Figure 4 | The maintenance of a balanced polymorphism in a diffuse 
interaction. To ascertain the conditions sufficient to maintain an R gene 
polymorphism stably in a diffuse interaction, we modelled the dynamics of an 
R gene polymorphism with a range of parameters for the cost of infection 

(y axis) and probability of infection with bacteria containing a recognized 
effector (x axis). With a starting frequency of 0.5 for the R gene and a cost of 
resistance of 0.08 (a value consistent with our observations in Fig. 3), we 
recursively simulated the frequency of the R gene to determine the number of 
generations that the R gene remained at a putatively detectable frequency 
(between 0.01 and 0.99). The simulation was run for 100,000 generations. 
The colour bar signifies the number of generations, up to 100,000, that the 
polymorphism is maintained. For a, the dynamics were modelled without 
frequency dependence. The R gene polymorphism is stably maintained for a 
natrow range of parameters. With the incorporation of negative frequency 
dependence into the model (described in Supplementary Information), as 
illustrated in b, the parameter space allowing a stable polymorphism expands. 


evolution (reviewed in ref. 12). We envision a situation in which infec- 
tion of susceptible plants increases resource availability for uninfected 
plants owing to a population-wide reduction in plant biomass or num- 
bers. The addition of frequency dependence substantially expands the 
parameter space over which the R gene polymorphism is maintained 
(Fig. 4b). Across a range of infection rates (20-99%), and for relatively 
low costs of infection (1-30%), the balanced polymorphism can be main- 
tained for thousands of generations (Fig. 4b and Extended Data Fig. 7). 
Thus, the maintenance of resistance polymorphisms does not require a 
tight coevolutionary pairing. 

Even though it is unlikely that many A. thaliana-pathogen interac- 
tions are tightly coupled (but see ref. 28), A. thaliana populations exhibit 
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widespread balancing selection at R gene loci’. Little is known of the 
ecological factors that drive R gene evolution in A. thaliana, and recent 
studies suggest that selection on resistance is both pathogen and host 
dependent. Our results point to the maintenance of one of these ancient 
balanced polymorphisms through complex interactions with multiple 
hosts sharing P. syringae, and through RPS5 interacting with multiple 
effectors distributed across multiple pathogen species. Recent studies 
revealing R-gene-mediated sexual incompatibilities and divergent selec- 
tion for resistance to necrotrophs versus biotrophs suggest that numer- 
ous additional interactions may select for variation at resistance loci””””. 
Our corresponding model demonstrates that a stable polymorphism 
does not require the tight coevolutionary coupling of a host and patho- 
gen, but rather can be maintained in the face of complex interactions such 
as the ones we observe. Many, if not most, pathogens persist in complex 
ecological communities, with multiple hosts and competitors’. Future 
work on host-pathogen coevolution should elucidate whether this com- 
plexity is a general mechanism for the maintenance of variation in sus- 
ceptibility and resistance. 


METHODS SUMMARY 


A genomic library of PNA29.1a was created using the broad-host-range vector pLAFR1, 
and the genomic fragment containing avrPphB2 expressed in PstDC3000 was shown 
to be sufficient to induce hypersensitive response in Col-0 plants. The genetic basis 
of differential hypersensitive response in 75 A. thaliana ecotypes in response to infec- 
tion with PstDC3000(avrPphB2) was mapped'* using EMMAX”°. The frequency of 
RPS5 was determined by a PCR assay on genetically distinct A. thaliana lines. The 
fitness effects of the RPS5 locus for A. thaliana were tested through comparisons of 
six paired isogenic lines. Two of the pairs were derived from Cre-lox site-specific 
recombination involving a fragment of the Col-0 ecotype containing the RPS5 cod- 
ing region, and the promoter and terminator regions in the Ga-0 ecotype. The remain- 
ing four pairs of isogenic lines within the Col-0 background were backcrosses of an 
RPS5 mutant containing an early stop codon at amino-acid position 319 generated 
in the Arabidopsis TILLING project. To measure the fitness effects of AvrPphB2, 
the growth ofa P. syringae strain transformed with an empty pME6010 plasmid was 
compared with strains transformed with pME6010 containing avrPphB2. avrPphB 
homologues were amplified and cloned into a modified version of the Gateway- 
compatible vector pMTN41, then expressed in DC3000. RPS5*’~ plants were infected 
with DC3000 carrying the different homologues to detect differential hypersensi- 
tive response. The dynamics of the R gene polymorphism over time were modelled 
with an infinite population size and a cost of resistance of 8%. For frequency-dependent 
selection, the fitness of a resistant individual was modelled as increasing with the 
frequency of susceptible individuals. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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The Get1/2 transmembrane complex is an 
endoplasmic-reticulum membrane protein insertase 


Fei Wang’, Charlene Chan’, Nicholas R. Weir! & Vladimir Denic! 


Hundreds of tail-anchored proteins, including soluble N-ethylmaleimide- 
sensitive factor attachment receptors (SNAREs) involved in vesicle 
fusion, are inserted post-translationally into the endoplasmic retic- 
ulum membrane by a dedicated protein-targeting pathway’ *. Before 
insertion, the carboxy-terminal transmembrane domains of tail- 
anchored proteins are shielded in the cytosol by the conserved targeting 
factor Get3 (in yeast; TRC40 in mammals)*’. The Get3 endoplasmic- 
reticulum receptor comprises the cytosolic domains of the Get1/2 
(WRB/CAML) transmembrane complex, which interact individu- 
ally with the targeting factor to drive a conformational change that 
enables substrate release and, as a consequence, insertion® "’. Because 
tail-anchored protein insertion is not associated with significant trans- 
location of hydrophilic protein sequences across the membrane, it 
remains possible that Get1/2 cytosolic domains are sufficient to place 
Get3 in proximity with the endoplasmic-reticulum lipid bilayer and 
permit spontaneous insertion to occur'””’. Here we use cell reporters 
and biochemical reconstitution to define mutations in the Get1/2 
transmembrane domain that disrupt tail-anchored protein insertion 
without interfering with Get1/2 cytosolic domain function. These 
mutations reveal a novel Get1/2 insertase function, in the absence of 
which substrates stay bound to Get3 despite their proximity to the lipid 
bilayer; as a consequence, the notion of spontaneous transmembrane 
domain insertion is a non sequitur. Instead, the Get1/2 transmem- 
brane domain helps to release substrates from Get3 by capturing their 
transmembrane domains, and these transmembrane interactions 
define a bona fide pre-integrated intermediate along a facilitated 
route for tail-anchor entry into the lipid bilayer. Our work sheds light 
on the fundamental point of convergence between co-translational 
and post-translational endoplasmic-reticulum membrane protein tar- 
geting and insertion: a mechanism for reducing the ability of a tar- 
geting factor to shield its substrates enables substrate handover toa 
transmembrane-domain-docking site embedded in the endoplasmic- 
reticulum membrane. 

We have previously found that elution of substrates from Get3 immo- 
bilized on a resin can be achieved in the absence of any membranes by 
the addition of an engineered heterodimer of Get1/2 cytosolic domains 
(miniGet1/2)*. At physiological protein concentrations, mini-Get1/2 en- 
abled substrate elution in a manner that was dependent on the inter- 
actions of both Get1/2 cytosolic domains with Get3. Notably, substrate 
elution by miniGet1/2 was also dependent on the presence of an engi- 
neered tail-anchored trap derived from Sgt2, a transmembrane domain 
(TMD)-recognition factor that delivers newly synthesized tail-anchored 
proteins to Get3 (ref. 14). By chemical crosslinking between Get3 and 
substrate, we have subsequently found that the tail-anchored trap pre- 
vents apparent re-binding of substrates to Get3 (Extended Data Fig. 1a). 
Thus, in the simplest model for insertion, the only role of the Get1/2 
TMD is to physically link the Get1/2 cytosolic domains so that they can 
work together to enable ‘trapping’ of substrate tail anchors by the nearby 
hydrophobic lipid bilayer. A more complex alternative to this sponta- 
neous insertion model is that the Get1/2 TMD is an insertase that facil- 
itates entry of substrate tail anchors into the lipid bilayer. 


The spontaneous insertion model predicts that the insertion of Get3 
substrates should be insensitive to genetic perturbations of the Get1/2 
TMD, which mediates complex formation, as long as the function of the 
Get1/2 cytosolic domains is preserved. To avoid the potential for com- 
plex disruption by mutations in the six transmembrane segments (Get1 
TMI1-3 and Get2 TM1-3), we first engineered a single-chain version 
of the Get1/2 heterodimer (Get2-1sc) expressed from the endogenous 
GET2 promoter in Aget] cells. The resulting protein fusion was func- 
tional (Extended Data Fig. 1b and Fig. 1a), as measured using a green 
fluorescent protein (GFP) cell reporter of heat-shock factor transcrip- 
tional activity’, which is a good monitor of tail-anchored protein aggre- 
gation in the cytosol due to compromised Get1/2 function®. Get1/2 
transmembrane segments were replaced with transmembrane segments 
from unrelated endoplasmic reticulum (ER) membrane proteins, either 
Sec61-B or Ost4 (Extended Data Fig. 2a). In addition, we mutated an 
absolutely conserved aspartic acid residue near the middle of Get2 TM3 
(D271K) because replacement of this transmembrane segment severely 
destabilized Get2-1sc (Extended Data Fig. 2b and data not shown). All 
the mutations in the Get1/2 TMD resulted in the loss of Get2-1sc func- 
tion, as evidenced by elevated heat-shock factor activity, with some alleles 
resulting in more apparent heat shock than others (Fig. 1a). 
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Figure 1 | In vivo and in vitro analysis of loss-of-function mutations in the 
Get1/2 TMD. a, The amount of GFP produced by the heat-shock reporter 
(Extended Data Fig. 1b) in the indicated strains was determined by flow 
cytometry analysis and plotted as the average of two independent experiments 
(n = 2). b, Affinity-purified Get3-Sec22 was incubated with the indicated 
microsomes at room temperature. Samples were resolved by SDS- 
polyacrylamide gel electrophoresis (SDS-PAGE) and visualized by 
autoradiography. The positions of unglycosylated and glycosylated Sec22 
(gSec22) are indicated. Insertion efficiency is expressed as the percentage of 
Sec22 that is glycosylated. c, Affinity-purified Get3-Sec22 was incubated with 
the indicated microsomes or mock-incubated at room temperature followed 
by crosslinking with disuccinimidyl suberate (amine-reactive crosslinker). 
Samples were subjected to SDS-PAGE and visualized by autoradiography. The 
crosslinked product between Get3 and Sec22 is indicated (Get3 X Sec22). 
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To measure the impact of TMD mutations on Get1/2 activity more 
directly, we first produced radiolabelled Sec22 (a SNARE tail-anchored 
protein that facilitates vesicle fusion in the early secretory system) by 
in vitro translation in a wild-type budding yeast cell extract. Next, we 
affinity-purified Get3-Sec22 and monitored insertion into ER-derived 
membranes (microsomes) by glycosylation at a carboxy (C)-terminal 
glycan attachment site. We observed that loss-of-function mutations in 
the Get1/2 TMD also caused a proportional decrease in Sec22 insertion 
(Fig. 1b and Extended Data Fig. 3a). Chemical crosslinking between Get3 
and Sec22 revealed that reduced Sec22 insertion was due to reduced 
substrate release from Get3 (Fig. 1c and Extended Data Fig. 3b). Even 
though Sec22 has a relatively hydrophobic TMD that might hinder its 
release from Get3, another tail-anchored protein, Sec61-B, which has 
a significantly less hydrophobic TMD, was similarly compromised by 
genetic disruptions of the Get1/2 TMD (Extended Data Fig. 3c). 

Loss of Get1/2 TMD function would be expected to cause a defect 
in Get3 substrate release if the ER bilayer were by itself not an efficient 
‘trap’ for TMDs delivered into its proximity by Get3 interactions with 
Get1/2 cytosolic domains. We performed two stringent tests of this idea 
by focusing on our strongest loss of function allele, Get2y2m,-1sc, which 
did not apparently disrupt Get2-1sc ER protein targeting and mem- 
brane topology (Supplementary Discussion and Extended Data Figs 4 
and 5). First, we took advantage of our previous observation that Get3 
interacts more avidly with miniGet1/2 than with either miniGet1©°"/2 
or miniGet1/2©2™, which contain point mutations in Getl and Get2 
cytosolic domains that abolish interactions with Get3 (ref. 8). This ex- 
plains why Get3-Sec22 association with microsomes during a flotation 
assay depends on its interactions with both Get1/2 cytosolic domains*: 
avidity is necessary for targeting complexes to remain membrane-bound 
in the absence of Get3 re-binding during the 3 h in which microsomes 
float to the top of a density gradient. Consistent with our starting hy- 
pothesis, we found that Get3-Sec22 targeting complexes maintained a 
better association with Get2y42,,-Get1sc microsomes relative to con- 
trol Get2?™/Get1 microsomes (Fig. 2a). As a second test, we adapted 
our tail-anchored trap system to work on the surface of membranes by 
preparing proteoliposomes with three ingredients: a phospholipid mix- 
ture that approximated the composition of the ER membrane, trace 
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amounts of a Ni-NTA (nitrilotriacetic acid) phospholipid for subse- 
quent tethering of the His-tagged tail-anchored trap, and Get2;y42m- 
Getlsc that we affinity purified from yeast in complex with endogenous 
Get3 (Fig. 2b and Extended Data Fig. 6a). In further support of our hy- 
pothesis, we observed that the tail-anchored trap increased substrate 
release from Get2-,42m-Get1sc proteoliposomes (Fig. 2b). Importantly, 
the tail-anchored trap was not able to rescue the substrate release defect 
of Get2/Get1©?™ proteoliposomes (Fig. 2b and Extended Data Fig. 6b). 
In summary, our data demonstrate that the ER bilayer is by itself not an 
efficient ‘trap’ for tail-anchored proteins delivered to its proximity by 
Get3 interactions with the cytosolic domains of Get1/2. They also argue 
that the Get1/2 complex has a distinct insertase function that resides in 
its TMD and that can be genetically separated from the function of its 
cytosolic domains. 

We hypothesized that Get1/2 interacts with substrate TMDs to cou- 
ple the release of tail-anchored proteins from Get3 to the subsequent 
membrane insertion step. To test this idea, we needed a means to sta- 
bilize potential Get1/2-substrate interactions during the rapid insertion 
process. Our solution was to fuse an S-tag to the C terminus of Sec22 
and add S protein, which binds to the S-tag to form a stable tertiary 
structure’®: S protein selectively blocked insertion of full-length Sec22 
with a C-terminal S peptide (Sec22s) (Fig. 3a) without compromising 
the ability of Get1/2 to drive release of the road-blocked substrate from 
Get3 (Fig. 3b). To establish if road-blocked Sec22s forms a defined 
complex with Get1/2, we monitored by chemical crosslinking the prox- 
imity of an engineered single cysteine in the Sec22 TMD (S192C) and 
single cysteines at a variety of Get1/2 transmembrane positions that do 
not perturb Get1/2 function in vivo (Extended Data Fig. 7a). When we 
used bis-maleimidohexane (BMH), which irreversibly crosslinks react- 
ive thiols that are within 13 A of one another, we detected crosslinks 
between Sec22 and several positions that lie along the same face of the 
Get] TM1 o-helix (Fig. 3c and Extended Data Fig. 7b). We also detected 
several crosslinks between Sec22 and positions on Get! TM2 and TM3, 
as well as a substrate-crosslinking position on Get2 TM3 (Extended Data 
Fig. 7c). Our data suggest that the road-blocked Sec22 interacts with a 
composite TMD-docking site that resides near the cytosolic side of the 
ER membrane, comprises transmembrane regions from both Get1 and 


Figure 2 | Get1/2 has a novel insertase function 
that resides in its TMD. a, Affinity-purified 
Get3—Flag-Sec22 was incubated with the indicated 
microsomes at room temperature. Samples were 
overlaid with an Optiprep gradient and subjected 
to ultracentrifugation. Proteins precipitated 

from the four indicated gradient fractions were 
resolved by SDS-PAGE and visualized by 
autoradiography and immunoblotting. Non- 
glycosylated Sec22 that associated with GET2- 
IFLAGsc, GET2pyy2n-1FLAGsc and GET2-°"" 
GETIFLAG microsomes corresponded to 11%, 
10% and 2% of the summed Sec22 signal in all the 
fractions, respectively. b, Ni-NTA proteoliposomes 
reconstituted with the indicated mutant Get1/2 
transmembrane complexes were incubated with 
tail-anchored (TA) trap or mock-incubated 
followed by crosslinking analysis as in Fig. Ic. 
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Figure 3 | The Get1/2 TMD binds a road-blocked tail-anchored protein 
released from Get3. a, Top: schematic showing attachment of S protein to the 
S-tag at the C terminus of a tail-anchored protein bound to Get3. Bottom: 
affinity-purified Get3-Sec22 or Get3-Sec22 (with S-tag) were incubated with S 
protein at room temperature or mock-incubated followed by Sec22 insertion 
analysis with wild-type microsomes as in Fig. 1b. b, Affinity-purified Get3- 
Sec22s was road-blocked with S protein and incubated with the indicated 
microsomes or mock-incubated at room temperature followed by crosslinking 
analysis as in Fig. Ic. ¢, Left: affinity-purified Get3-Sec22s192cs was road- 
blocked with S protein and incubated with the indicated microsomes or mock- 
incubated at room temperature followed by BMH crosslinking on ice. 
Samples were subjected to SDS-PAGE and visualized by autoradiography. 
Uncrosslinked Sec22s and an approximately 60 kDa crosslink product between 
Get1 and Sec22s are indicated. Right: schematic showing the membrane 
topology of Get1 with crosslinking positions on TM1 indicated. 
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Get2, and is aqueous in nature, since BMH only reacts with thiols ex- 
posed to an aqueous environment”. 

Three lines of evidence argue that by road-blocking Sec22 we have 
stabilized a bona fide on-pathway (pre-integrated) intermediate. First, 
we detected no TMD crosslinking to an insertase-defective Get1/2 (Ex- 
tended Data Fig. 8a, b). Second, as would be predicted for an arrested 
on-pathway intermediate, addition of S peptide after intermediate ac- 
cumulation efficiently relieved arrest and resulted in efficient ‘chasing’ 
to the inserted product (Fig. 4a). Lastly, when we monitored the kin- 
etics of Sec22 TMD crosslinking to two positions on Get] TM1 during 
unimpeded integration, we observed the expected rapid and transient 
intermediate formation (Fig. 4b and Extended Data Fig. 8c). Impor- 
tantly, Sbh1 (yeast Sec61-B homologue) followed a similar integration 
pathway to Sec22 (Extended Data Fig. 8d, e), arguing that we have un- 
covered a general mechanism for insertion of tail-anchored proteins 
into the ER. 

Our work reveals that the Get1/2 transmembrane complex is an inser- 
tase that, compared with the Sec61 protein translocon in the ER mem- 
brane, provides an essential mechanistic insight into membrane protein 
insertion in general. Specifically, in the case of co-translational ER mem- 
brane protein insertion, the precise mechanism by which the signal rec- 
ognition particle hands over its substrate to a binding site located within 
the Sec61 channel is not known but it involves GTP hydrolysis of the 
signal recognition particle and competitive binding between Sec61 and 
the signal recognition particle to their overlapping sites near the ribo- 
some exit tunnel'*””. In the case of post-translational tail-anchored pro- 
tein insertion into the ER, we have shown that Get1/2 cytosolic domains 
collaborate with the Get1/2 TMD to enable efficient substrate release 
from Get3. Speculatively, we suggest that substrate handover between 
Get3 and Get1/2 creates a facilitated pathway that overcomes the kin- 
etic energy barrier between substrate release from the open conforma- 
tion of Get3 and the thermodynamically favourable substrate state in 
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Figure 4 | Tail-anchored protein docking to the Get1/2 TMD is en route a 
facilitated integration pathway. a, Left: schematic illustrating two extreme 
outcomes of the S peptide ‘chase’ experiment. Right: affinity-purified 
Get3-Sec22s was split four-ways and subjected to the three stages of the 
experiment, as indicated, followed by Sec22s insertion analysis with wild-type 
microsomes as in Fig. 1b. b, Affinity-purified Get3-Sec22s)92cs was incubated 
with Getl,}gc microsomes at room temperature for the indicated amounts 
of time. Samples were simultaneously subjected to Sec22 insertion and 
crosslinking analysis as in Figs 1b and 3c, respectively. On the right is a kinetic 
analysis plot of the gel data shown on the left. XL refers to the crosslinked 
product between Get1 and Sec22; INS, Sec22s)92c inserted. c, Working model 


Reaction coordinate 


for the Get1/2 insertase mechanism. The cytosolic domains work together with 
a membrane-embedded TMD-docking site to afford a coupled mechanism 
that facilitates substrate release from Get3 and TMD partitioning into the 
phospholipid bilayer. We speculate that substrate binding to the TMD-docking 
site reduces the activation energy barrier (dotted line) that is apparently 
associated with spontaneous insertion of substrates delivered to insertase- 
defective Get1/2 (solid line). One way in which the TMD-docking site might 
afford an integration path around the hydrophilic headgroup barrier is in the 
form of an aqueous pocket lined on one side by phospholipid hydrocarbon 
chains, and on the other by a composite surface of Getl and Get2 
transmembrane helices. 
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which the TMD has partitioned into the lipid bilayer (Fig. 4c). Thus, the 
Get1/2 insertase mechanism, stripped away of the complexities assoc- 
iated with insertion coupled to co-translational protein translocation, 
reveals the fundamental molecular logic of membrane protein targeting 
and insertion: a mechanism for reducing the ability ofa targeting factor 
to shield its substrates enables substrate handover to a binding site that 
is membrane-embedded within an insertion machine. 


METHODS SUMMARY 


Standard methods for monitoring Get1/2 function in vivo and in vitro, as well as 
Saccharomyces cerevisiae strains and plasmids used in this study, are described in 
detail in Methods. 

For protein engineering, Get1 was linked to Get2 using the sequence ASGAGG 
SEGGGSEGGTSGAT, previously used to link the subunits of a hexameric AAA* 
ATPase”’. S-protein attachment was achieved by mixing 2 il of the appropriate 
affinity-purified Get3 targeting complexes with 0.4 1] of S-protein (EMD Millipore) 
in 8 pl of insertion buffer (22 mM HEPES-KOH pH 7.4, 1.5 mM Mg(OAc)3, 120 mM 
KOAc, 2mM DTT, 14% glycerol, 0.75 mM ATP, 25 mM creatine phosphate and 
330 gml | creatine kinase) for 10 min at room temperature. In the experiment 
shown in Fig. 4a, the S-protein road-block was alleviated by the addition of syn- 
thetic S peptide (1 pg ul”') (KETAAAKFERQHMDS; Eton Bioscience). 

To release tail-anchored substrate from Get3, 2 ul of affinity-purified Get3 tar- 
geting complex was mixed with 1 il of microsomes (attenuance D290 nm = 80) in 
8 ul of insertion buffer with ATP regenerating system and incubated for 30 min at 
room temperature (23 °C). This was followed by crosslinking with 0.5 mM disuc- 
cinimidyl suberate (Pierce) at room temperature for 30 min. Crosslinking was 
quenched by incubation with 50 mM Tris-HCl, pH 7.4 for 5 min at room temper- 
ature. Before monitoring Sec22 release from Get3 by proteoliposomes (Fig. 2a), we 
added 0.3 jig of tail-anchored trap to 1 ul of proteoliposomes and incubated for 
20 min at room temperature. In one instance (Extended Data Fig. 1), membranes 
were substituted with miniGet1/2 and tail-anchored trap (0.1 pg pl ') or mock trap 
(0.1 pg pl"), as described previously’. 

For TMD docking to Get1/2, 2 ull of affinity-purified Get3 targeting complex was 
mixed with 1 pl of microsomes (D9 nm = 80) in 8 pl of insertion buffer at room 
temperature followed by crosslinking with 0.2 mM BMH (Pierce) on ice for 60 min. 
Crosslinking was quenched by incubation with 50 mM DTT for 15 min on ice. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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The transcriptome is the readout of the genome. Identifying common 
features in it across distant species can reveal fundamental principles. 
To this end, the ENCODE and modENCODE consortia have generated 
large amounts of matched RNA-sequencing data for human, worm 
and fly. Uniform processing and comprehensive annotation of these 
data allow comparison across metazoan phyla, extending beyond ear- 
lier within-phylum transcriptome comparisons and revealing ancient, 
conserved features’ °. Specifically, we discover co-expression modules 
shared across animals, many of which are enriched in developmental 
genes. Moreover, we use expression patterns to align the stages in worm 
and fly development and find a novel pairing between worm embryo 
and fly pupae, in addition to the embryo-to-embryo and larvae-to- 
larvae pairings. Furthermore, we find that the extent of non-canonical, 
non-coding transcription is similar in each organism, per base pair. 
Finally, we find in all three organisms that the gene-expression levels, 
both coding and non-coding, can be quantitatively predicted from 


chromatin features at the promoter using a ‘universal model’ based 
on a single set of organism-independent parameters. 

Our comparison used the ENCODE-modENCODE RNA resource 
(Extended Data Fig. 1). This resource comprises: deeply sequenced RNA- 
sequencing (RNA-seq) data from many distinct samples from all three 
organisms; comprehensive annotation of transcribed elements; and uni- 
formly processed, standardized analysis files, focusing on non-coding 
transcription and expression patterns. Where practical, these data sets 
match comparable samples across organisms and to other types of func- 
tional genomics data. In total, the resource contains 575 different exper- 
iments containing >67 billion sequence reads. It encompasses many 
different RNA types, including poly(A)+, poly(A)-, ribosomal-RNA- 
depleted, short and long RNA. 

The annotation in the resource represents a capstone for the decade- 
long efforts in human, worm and fly. The new annotation sets have 
numbers, sizes and families of protein-coding genes similar to previous 
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compilations; however, the number of pseudogenes and annotated non- 
coding RNAs differ (Extended Data Fig. 2, Extended Data Table 1 and 
Supplementary Fig. 1). Also, the number of splicing events is greatly 
increased, resulting in a concomitant increase in protein complexity. We 
find the proportion of the different types of alternative splicing (for exam- 
ple, exon skipping or intron retention) is generally similar across the three 
organisms; however, skipped exons predominate in human while retained 
introns are most common in worm and fly’ (Extended Data Fig. 3, Sup- 
plementary Fig. 1 and Supplementary Table 1). 

A fraction of the transcription comes from genomic regions not asso- 
ciated with standard annotations, representing ‘non-canonical transcrip- 
tion’ (Supplementary Table 2)*. Using a minimum-run—maximum-gap 
algorithm to process reads mapping outside of protein-coding tran- 
scripts, pseudogenes and annotated non-coding RNAs, we identified 
read clusters; that is, transcriptionally active regions (TARs). Across all 
three genomes we found roughly one-third of the bases gives rise to TARs 
or non-canonical transcription (Extended Data Table 1). To determine 
the extent that this transcription represents an expansion of the current 
established classes of non-coding RNAs, we identified the TARs most sim- 
ilar to known annotated non-coding RNAs using a supervised classifier? 
(Supplementary Fig. 2 and Supplementary Table 2). We validated the 
classifier’s predictions using RT-PCR (PCR with reverse transcription), 
demonstrating high accuracy. Overall, these predictions encompass only 
a small fraction of all TARs, suggesting that most TARs have features 
distinct from annotated non-coding RNAs and that the majority of 
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c Module expression in fly 


Figure 1 | Expression clustering. a, Left, human, 
worm and fly gene-gene co-association matrix; 
darker colouring reflects the increased likelihood 
that a pair of genes are assigned to the same 
module. A dark block along the diagonal represents 
a group of genes within a species. If this is 
associated with an off-diagonal block then it is a 
cross-species module (for example, a three-species 
conserved module is shown with a circle and a 
worm-fly module, with a star). However, if a 
diagonal block has no off-diagonal associations, 
then it forms a species-specific module (for 
example, green pentagon). Right, the Gene 
Ontology functional enrichment of genes within 
the 16 conserved modules is shown. GF, growth 
factor; nuc., nuclear; proc., processing. b, Primary 
and secondary alignments of worm-and-fly 
developmental stages based on all worm-fly 
orthologues. Inset shows worm-fly stage alignment 
using only hourglass orthologues is more 
significant and exhibits a gap (brown) matching 
the phylotypic stage. The scale for the heat map 
in b is indicated on the left side of the scale in 

a (labelled stage alignment). c, Normalized 
expression of the conserved modules in fly shows 
the smallest intra-organism divergence during 
the phylotypic stage (brown). A representative 
module is indicated with a blue asterisk in a 

and c. (For further details see Extended Data Figs 5 
and 6; ref. 20, related to the left part ofa; and ref. 21, 
related to the bottom part of b.) 
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non-coding RNAs of established classes have already been identified. 
To shed further light on the possible roles of TARs we intersected them 
with enhancers and HOT (high-occupancy target) regions*!* ", finding 
statistically significant overlaps (Extended Data Fig. 4 and Supplemen- 
tary Table 2). 

Given the uniformly processed nature of the data and annotations, 
we were able to make comparisons across organisms. First, we built co- 
expression modules, extending earlier analysis“ (Fig. 1a). To detect mod- 
ules consistently across the three species, we combined across-species 
orthology and within-species co-expression relationships. In the result- 
ing multilayer network we searched for dense subgraphs (modules), using 
simulated annealing’*’®. We found some modules dominated by a single 
species, whereas others contain genes from two or three. As expected, the 
modules with genes from multiple species are enriched in orthologues. 
Moreover, a phylogenetic analysis shows that the genes in such modules 
are more conserved across 56 diverse animal species (Extended Data 
Fig. 5 and Supplementary Fig. 3). To focus on the cross-species conserved 
functions, we restricted the clustering to orthologues, arriving at 16 con- 
served modules, which are enriched in a variety of functions, ranging 
from morphogenesis to chromatin remodelling (Fig. 1a and Supplemen- 
tary Table 3). Finally, we annotated many TARs based on correlating their 
expression profiles with these modules (Extended Data Fig. 4). 

Next, we used expression profiles of orthologous genes to align the 
developmental stages in worm and fly (Fig. 1b and Extended Data Fig. 6). 
For every developmental stage, we identified stage-associated genes; that 
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is, genes highly expressed at that particular stage but not across all stages. 
We then counted the number of orthologous pairs among these stage- 
associated genes for each possible worm-and-fly stage correspondence, 
aligning stages by the significance of the overlap. Notably, worm stages 
map to two sets of fly stages. First, they match in a co-linear fashion to the 
fly (that is, embryos-to-embryos, larvae-to-larvae). However, worm late 
embryonic stages also match fly pupal stages, suggesting a shared expres- 
sion program between embryogenesis and metamorphosis. The approx- 
imately 50 stage-associated genes involved in this dual alignment are 
enriched in functions such as ion transport and cation-channel activity 
(Supplementary Table 3). 

To gain further insight into the stage alignment, we examined our 16 
conserved modules in terms of the ‘hourglass hypothesis’, which posits 
that all animals go through a particular stage in embryonic development 
(the tight point of the hourglass or ‘phylotypic’ stage) during which the 
expression divergence across species for orthologous genes is smallest**””. 
For genes in 12 of the 16 modules, we observed canonical hourglass 
behaviour; that is, inter-organism expression divergence across closely 
related fly species during development is minimal’ (Supplementary Fig. 3). 
Moreover, we find a subset of TARs also exhibit this hourglass behav- 
iour (Supplementary Fig. 2). Beyond looking at inter-species divergence, 
we also investigated the intra-species divergence within just Drosophila 
melanogaster and Caenorhabditis elegans. Notably, we observed that 
divergence of gene expression between modules is minimized during 
the worm and fly phylotypic stages (Fig. 1c). This suggests, for an indi- 
vidual species, the expression patterns of different modules are most 
tightly coordinated (low divergence) during the phylotypic stage, but 
each module has its own expression signature before and after this. In 
fact it is possible to see this coordination directly as a local maximum 
in between-module correlations for the worm (Extended Data Fig. 5). 
Finally, using genes from just the 12 ‘hourglass modules’, we found that 
the alignment between worm and fly stages becomes stronger (Fig. 1b 
and Supplementary Fig. 3); in particular it shows a gap where no changes 
are observed, perfectly matching the phylotypic stage. 

The uniformly processed and matched nature of the transcriptome data 
also facilitates integration with upstream factor-binding and chromatin- 
modification signals. We investigated the degree to which these upstream 
signals can quantitatively predict gene expression and how consistent 
this prediction is across organisms. Similar to previous reports'’*", 
we found consistent correlations, around the transcription start site (TSS), 
in each of the three species between various histone-modification signals 
and the expression level of the downstream gene: H3K4mel, H3K4me2, 
H3K4me3 and H3K27ac are positively correlated, whereas H3K27me3 
is negatively correlated (Fig. 2, Extended Data Fig. 7 and Supplementary 
Fig. 4). Then for each organism, we integrated these individual correla- 
tions into a multivariate, statistical model, obtaining high accuracy in 
predicting expression for protein-coding genes and non-coding RNAs. 
The promoter-associated marks, H3K4me2 and H3K4me3, consistently 
have the highest contribution to the model. 

A similar statistical analysis with transcription factors showed the cor- 
relation between gene expression and transcription-factor binding to be 
the greatest at the TSS, positively for activators and negatively for repres- 
sors (Extended Data Fig. 7). Integrated transcription-factor models in 


0.74 0.73 0.51 correlations all significant, 
P<1X 10 ') of the organism- 
p84 : oes : sata specific and universal models. (See 
100 mRNAs ncRNAs Extended Data Figs 7 and 8 for 


further details.) 


Model accuracy 


each organism also achieved high accuracy for protein-coding genes 
and non-coding RNAs, with as few as five transcription factors neces- 
sary for accurate predictions (Extended Data Fig. 8). This perhaps reflects 
an intricate, correlated structure to regulation. The relative importance 
of the upstream regions is more peaked for the transcription-factor models 
than for the histone ones, likely reflecting the fact that histone modifi- 
cations are spread over broader regions, including the gene body, whereas 
most transcription factors bind near the promoter. 

Finally, we constructed a ‘universal model’, containing a single set of 
organism-independent parameters (Fig. 2 and Supplementary Fig. 4). 
This achieved accuracy comparable to the organism-specific models. 
In the universal model, the consistently important promoter-associated 
marks such as H3K4me2 and H3K4me3 are weighted most highly. In 
contrast, the enhancer mark H3K4mel is down-weighted, perhaps re- 
flecting that signals for most human enhancers are not near the TSS. 
Using the same set of organism-independent parameters derived from 
training on protein-coding genes, the universal model can also accur- 
ately predict non-coding RNA expression. 

Overall, our comparison of the transcriptomes of three phylogeneti- 
cally distant metazoans highlights fundamental features of transcription 
conserved across animal phyla. First, there are ancient co-expression 
modules across organisms, many of which are enriched for developmen- 
tally important hourglass genes. These conserved modules have highly 
coordinated intra-organism expression during the phylotypic stage, but 
display diversified expression before and after. The expression cluster- 
ing also aligns developmental stages between worm and fly, revealing 
shared expression programs between embryogenesis and metamor- 
phosis. Finally, we were able to build a single model that could predict 
transcription in all three organisms from upstream histone marks using 
a single set of parameters for both protein-coding genes and non-coding 
RNAs. Overall, our results underscore the importance of comparing 
divergent model organisms to human to highlight conserved biological 
principles (and disentangle them from lineage-specific adaptations). 


METHODS SUMMARY 


Detailed methods are given in the Supplementary Information. (See the first section 
of the Supplementary Information for a guide.) More details on data availability are 
given in section F of the Supplementary Information. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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Genome function is dynamically regulated in part by chromatin, which 
consists of the histones, non-histone proteins and RNA molecules 
that package DNA. Studies in Caenorhabditis elegansand Drosophila 
melanogaster have contributed substantially to our understanding 
of molecular mechanisms of genome function in humans, and have 
revealed conservation of chromatin components and mechanisms’. 
Nevertheless, the three organisms have markedly different genome 
sizes, chromosome architecture and gene organization. On human 
and fly chromosomes, for example, pericentric heterochromatin flanks 
single centromeres, whereas worm chromosomes have dispersed het- 
erochromatin-like regions enriched in the distal chromosomal ‘arms’, 
and centromeres distributed along their lengths**. To systematically 
investigate chromatin organization and associated gene regulation 
across species, we generated and analysed a large collection of genome- 
wide chromatin data sets from cell lines and developmental stages in 
worm, fly and human. Here we present over 800 new data sets from 
our ENCODE and modENCODE consortia, bringing the total to over 
1,400. Comparison of combinatorial patterns of histone modifi- 
cations, nuclear lamina-associated domains, organization of large- 
scale topological domains, chromatin environment at promoters and 


enhancers, nucleosome positioning, and DNA replication patterns 
reveals many conserved features of chromatin organization among 
the three organisms. We also find notable differences in the compo- 
sition and locations of repressive chromatin. These data sets and ana- 
lyses provide a rich resource for comparative and species-specific 
investigations of chromatin composition, organization and function. 

We used chromatin immunoprecipitation followed by DNA sequenc- 
ing (ChIP-seq) or microarray hybridization (ChIP-chip) to generate 
profiles of core histones, histone variants, histone modifications and 
chromatin-associated proteins (Fig. 1, Supplementary Fig. 1 and Sup- 
plementary Tables 1 and 2). Additional data include DNase I hypersen- 
sitivity sites in fly and human cells, and nucleosome occupancy maps in 
all three organisms. Compared to our initial publications’ ”, this repre- 
sents a tripling of available fly and worm data sets and a substantial increase 
in human data sets (Fig. 1b, c). Uniform quality standards for experi- 
mental protocols, antibody validation and data processing were used 
throughout the projects’. Detailed analyses of related transcriptome 
and transcription factor data are presented in accompanying papers”. 

We performed systematic cross-species comparisons of chromatin 
composition and organization, focusing on targets profiled in at least two 


1Center for Biomedical Informatics, Harvard Medical School, Boston, Massachusetts 02115, USA. “Division of Genetics, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, 
Boston, Massachusetts 02115, USA. °Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA. “Department of Biostatistics and Computational Biology, 
Dana-Farber Cancer Institute and Harvard School of Public Health, 450 Brookline Avenue, Boston, Massachusetts 02215, USA. 'Department of Biology and Carolina Center for Genome Sciences, University 
of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA. °Department of Information and Computer Engineering, Ajou University, Suwon 443-749, Korea. ’Systems Biomedical Informatics 
Research Center, College of Medicine, Seoul National University, Seoul 110-799, Korea. ®Department of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Lab, Berkeley, California 
94720, USA. °Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, California 94720, USA. !°Department of Molecular Biology, Massachusetts General Hospital and 

Harvard Medical School, Boston, Massachusetts 02114, USA. '!The Gurdon Institute and Department of Genetics, University of Cambridge, Tennis Court Road, Cambridge CB2 1QN, UK. *National 


Institute of General Medical Sciences, National Institutes of Health, Bethesda, Maryland 20892, USA. 13National Human Genome Research Institute, National Institutes of Heal 
20892, USA. “Department of Biology, Washington University in St. Louis, St. Louis, Missouri 63130, USA. !°Computer Science and Artificial Inte 


, Bethesda, Maryland 
igence Laboratory, Massachusetts Institute of Technology, 


Cambridge, Massachusetts 02139, USA. 16Broad Institute, Cambridge, Massachusetts 02141, USA. Program in Bioinformatics, Boston University, Boston, Massachusetts 02215, USA. 18Department of 
Molecular, Cell and Developmental Biology, University of California Santa Cruz, Santa Cruz, California 95064, USA. !°Department of Bioinformatics, School of Life Science and Technology, Tongji University, 


Shanghai 200092, China. 2°Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115, USA. 21Departmen of Molecula 


it 


Biology and Biochemistry, Rutgers University, Piscataway, 


New Jersey 08854, USA. °*Food Science and Technology Department, Faculty of Agriculture, Alexandria University, 21545 El-Shatby, Alexandria, Egypt. °?Department of Pharmacology and Cancer Biology, 


Duke University Medical Center, Durham, 


orth Carolina 27710, USA. °Harvard/MIT Division of Health Sciences and Technology, Cambridge, Massachusetts 02139, USA. *°Department of Anatomy 


Physiology and Cell Biology, University of California Davis, Davis, California 95616, USA. 2°Department of Biology, Center for Genomics and Systems Biology, New York University, New York, New York 
10003, USA. ?’Princess Margaret Cancer Centre, Toronto, Ontario M6G 1L7, Canada. *®Howard Hughes Medical Institute, Chevy Chase, Maryland 20815, USA. ?°Department of Molecular Biology, Cellular 
Biology and Biochemistry, Brown University, Providence, Rhode Island 02912, USA. °°Department of Computer Science and Engineering, University of Washington, Seattle, Washington 98195, USA. 


31Department of Molecular Biology, Umea University, 901 87 Umea, Sweden. 32Seoul 


ational University Biomedical Informatics, Division of Biomedical Informatics, College of Medicine, Seoul National 


University, Seoul 110-799, Korea. °?Department of Pathology, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts 02114, USA. °4Department of Genome Sciences, 
University of Washington, Seattle, Washington 98195, USA. 35Department of Chemistry, Boston University, Boston, Massachusetts 02215, USA. 3©\nformatics Program, Children’s Hospital, Boston, 


Massachusetts 02215, USA. +Present addresses: Victor Chang Cardiac Research Institute and The University of New South Wales, Sydney, 


ew South Wales 2052, Australia (J.W.K.H.); Department of 


Biochemistry, University at Buffalo, Buffalo, New York 14203, USA (T.L.); Department of Molecular Biology and Lewis Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey 
08540, USA (K.I., T.E.J.); Department of Human Genetics, University of Chicago, Chicago, Illinois 06037, USA (J.D.L.); Division of Genomic Technologies, Center for Life Science Technologies, RIKEN, 
Yokohama 230-0045, Japan (A.M.); Department of Genetics, Department of Computer Science, Stanford University, Stanford, California 94305, USA (A.K.); Department of Biology, The University of 


Alabama at Birmingham, Birmingham, Alabama 35294, USA (N.C.R.). 
*These authors contributed equally to this work. 


28 AUGUST 2014 | VOL 512 | NATURE | 449 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


a Histone Non-Histone Others b 
H7-hESC B HE 
H1-hESC [i |e EH HHEHS i i | a 
K562 fl eeEn#o#oESs Bean fan | Human is 
; ree Peds ff i 
E ela - ‘ 
5 HepG2 [il os of 8G a ae : 
NHDF-Ad [i BH EH&S We 
HUVEC, NHEK [i EH — | Oo orn as 
Osteoblast Bf | | | Se et ot ee, 
an 7 me ™ 0 100 200 300 400 500 600 
Ke i nn ene oan Number of data sets 
Clone 8 
BG i il 
a ES14, ES10, ES5 mew 
ir r Early emnbirye ares | — a i+ | li modENCODE publications 2010 
ate embryo 14- * icati 
Third instar larvae (L3) FR Am | HEADED ue 
Adult head [ii i | 
Ovary, L3 sexed M or F c 
EWecweice) a |_| 4 ro 
Ixea embryo 
Late embryo a a se = 
oe eo See ee ry 
6 arvae stage 
= AD germline 
AD no embryos | Worn i 
AD germlineless | t T T T T T 
Le ti 1or2 
a ROSLERESURALOERELE EYP LLL MOP RTECS P8ES > ee ia 
& ChIP-seq rj LEE LOSES LPO SEE EEL ES POLE ILE CIP LLE SS SSS cS ES Number of factors or marks 
B ChIP-chip SASL OES SICSSERRE GRELG SSCS ES? F IES 
i Non-ChiP FRE FLEES TO ED OE SES #2 Bi Histone 
External source x 2 S RS a xe BN hist 
By RS & Re Ss jon-histone 
g $ S$ > B 


Figure 1 | Overview of the data set. a, Histone modifications, chromosomal 
proteins and other profiles mapped in at least two species (see Supplementary 
Fig. 1 for the full data set and Supplementary Table 1 for detailed 
descriptions). Different protein names for orthologues are separated by ‘/ 
(see Supplementary Table 2). b, The number of all data sets generated by this 


organisms (Fig. 1). Sample types used most extensively in our analyses 
are human cell lines H1-hESC, GM12878 and K562; fly late embryos, 
third instar larvae and cell lines $2, Kc, BG3; and worm early embryos 
and stage 3 larvae. Our conclusions are summarized in Extended Data 
Table 1. 

Not surprisingly, the three species show many common chromatin 
features. Most of the genome in each species is marked by at least one 
histone modification (Supplementary Fig. 2), and modification patterns 
are similar around promoters, gene bodies, enhancers and other chro- 
mosomal elements (Supplementary Figs 3 -12). Nucleosome occupancy 
patterns around protein-coding genes and enhancers are also largely sim- 
ilar across species, although we observed subtle differences in H3K4me3 
enrichment patterns around transcription start sites (TSSs) (Extended 
Data Fig. la and Supplementary Figs 12-14). The configuration and 
composition of large-scale features such as lamina-associated domains 
(LADs) are similar (Supplementary Figs 15 -17). LADs in human and 
fly are associated with late replication and H3K27me3 enrichment, sug- 
gesting a repressive chromatin environment (Supplementary Fig. 18). 
Finally, DNA structural features associated with nucleosome position- 
ing are strongly conserved (Supplementary Figs 19 and 20). 

Although patterns of histone modifications across active and silent 
genes are largely similar in all three species, there are some notable dif- 
ferences (Extended Data Fig. 1b). For example, H3K23ac is enriched at 
promoters of expressed genes in worm, but is enriched across gene bodies 
of both expressed and silent genes in fly. H4K20mel is enriched on both 
expressed and silent genes in human but only on expressed genes in fly 
and worm (Extended Data Fig. 1b). Enrichment of H3K36me3 in genes 
expressed with stage or tissue specificity is lower than in genes expressed 
broadly, possibly because profiling was carried out on mixed tissues (Sup- 
plementary Figs 21-23; see Supplementary Methods). Although the co- 
occurrence of pairs of histone modifications is largely similar across the 
three species, there are clearly some species-specific patterns (Extended 
Data Fig. 1c and Supplementary Figs 24 and 25). 

Previous studies showed that in human?” and fly’”’ prevalent com- 
binations of marks or ‘chromatin states’ correlate with functional features 
such as promoters, enhancers, transcribed regions, Polycomb-associated 
domains, and heterochromatin. ‘Chromatin state maps’ provide a con- 
cise and systematic annotation of the genome. To compare chromatin 
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and previous consortia publications’* (new, 815; old, 638). Each data set 
corresponds to a replicate-merged normalized profile of a histone, histone 
variant, histone modification, non-histone chromosomal protein, nucleosome 
or salt-fractionated nucleosome. c, The number of unique histone marks or 
non-histone chromosomal proteins profiled. 


states across the three organisms, we developed and applied a novel hier- 
archical non-parametric machine-learning method called hiHMM (see 
Supplementary Methods) to generate chromatin state maps from eight 
histone marks mapped in common, and compared the results with pub- 
lished methods (Fig. 2 and Supplementary Figs 26-28). We find that com- 
binatorial patterns of histone modifications are largely conserved. Based 
on correlations with functional elements (Supplementary Figs 29-32), 
we categorized the 16 states into six groups: promoter (state 1), enhan- 
cer (states 2 and 3), gene body (states 4-9), Polycomb-repressed (states 
10 and 11), heterochromatin (states 12 and 13), and weak or low signal 
(states 14-16). 

Heterochromatin is a classically defined and distinct chromosomal 
domain with important roles in genome organization, genome stability, 
chromosome inheritance and gene regulation. It is typically enriched 
for H3K9me3 (ref. 12), which we used as a proxy for identifying het- 
erochromatic domains (Fig. 3a and Supplementary Figs 33 and 34). As 
expected, the majority of the H3K9me3-enriched domains in human 
and fly are concentrated in the pericentromeric regions (as well as other 
specific domains, such as the Y chromosome and fly 4th chromosome), 
whereas in worm they are distributed throughout the distal chromosomal 
‘arms’'**"4 (Fig. 3a). In all three organisms, we find that more of the 
genome is associated with H3K9me3 in differentiated cells and tissues 
compared to embryonic cells and tissues (Extended Data Fig. 2a). We 
also observe large cell-type-specific blocks of H3K9me3 in human and 
fly'*’*> (Supplementary Fig. 35). These results suggest a molecular basis 
for the classical concept of ‘facultative heterochromatin’ formation to 
silence blocks of genes as cells specialize. 

Two distinct types of transcriptionally repressed chromatin have been 
described. As discussed above, classical ‘heterochromatin’ is generally 
concentrated in specific chromosomal regions and enriched for H3K9me3 
and also H3K9me2 (ref. 12). In contrast, Polycomb-associated silenced 
domains, involved in cell-type-specific silencing of developmentally 
regulated genes''™’, are scattered across the genome and enriched for 
H3K27me3. We found that the organization and composition of these 
two types of transcriptionally silent domains differ across species. First, 
human, fly and worm display significant differences in H3K9 methylation 
patterns. H3K9me2 shows a stronger correlation with H3K9me3 in fly 
than in worm (r = 0.89 versus r = 0.40, respectively), whereas H3K9me2 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


Genomic coverage (%) 


State No. Human Fly Worm Human Fly Worm 
Promoter 17 0.6| 2 //4.3/3.9|| 4 | 24 
Enhancer1 2 2.1 | 2.9/3.4 | 3.4)/ 3.8 | 5.4 
Enhancer2 3 12:6) 4 ||83 | 83] 7.8 |10.5 
Transcription 5’1 4 [i 1.4] 24] 2 [3.8|[ 1.1 | 24 
Transcription 5°25 4.8 | 1.7 || 8 | 7.1]/ 0.5 | 2.8 
Gene, H4K20me1 6 3.5 | 7.6 || 7.9 | 6.9 || 4.1 | 5.9 
Transcription 3°17 2.1 | 4.1 || 4.4 | 5.8 || 7.4 | 8.4 
Transcription 3°2 8 6.5 | 1.3 || 3.9 | 2.6 || 2.4 | 3.3 
Transcription 3°39 2 | 64] 3.9 | 3.1 10.5) 7.3 
PC repressed 1 70 |] 19| 1 |[3.9 | 3.3] 1.9 | 0.8 
PC repressed 2 17 7.6 (| 5.3 | 5.3 || 3.7 | 6 
Heterochromatin 1 12 | | 3.7 | 2.5 || 5.8 | 5.5 || 4.9 | 4.5 
Heterochromatin 2 13 6.2 | 7.8 || 2.7 | 3.2 || 3.4 | 6.5 
Low signal 1 14 3.2 ie 
Low signal 2 15 [Sm aS 113.4 
Low signal 3 16 3.9 | 2 ||3.6 | 5.2]/ 2.6 | 2.1 
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Figure 2 | Shared and organism-specific chromatin states. Sixteen 
chromatin states derived by joint segmentation using hiHMM (see 
Supplementary Methods) based on enrichment patterns of eight histone 


is well correlated with H3K9mel in worm but not in fly (r = 0.44 versus 
r= —0.32, respectively) (Fig. 3b). These findings suggest potential dif- 
ferences in heterochromatin in the three organisms (see below). Second, 
the chromatin state maps reveal two distinct types of Polycomb-associated 
repressed regions: strong H3K27me3 accompanied by marks for active 
genes or enhancers (Fig. 2, state 10; perhaps due to mixed tissues in whole 
embryos or larvae for fly and worm), and strong H3K27me3 without 
active marks (state 11) (see also Supplementary Fig. 31). Third, we observe 
a worm-specific association of H3K9me3 and H3K27me3. These two 
marks are enriched together in states 12 and 13 in worm but not in human 
and fly. This unexpectedly strong association between H3K9me3 and 
H3K27me3 in worm (observed with several validated antibodies; Extended 
Data Fig. 2b) suggests a species-specific difference in the organization 
of silent chromatin. 

Wealso compared the patterns of histone modifications on expressed 
and silent genes in euchromatin and heterochromatin (Extended Data 
Fig. 2c and Supplementary Fig. 36). We previously reported prominent 
depletion of H3K9me3 at TSSs and high levels of H3K9me3 in the gene 
bodies of expressed genes located in fly heterochromatin”, and now find 
a similar pattern in human (Extended Data Fig. 2c and Supplementary 
Fig. 36). In these two species, H3K9me3 is highly enriched in the body of 
both expressed and silent genes in heterochromatic regions. In contrast, 
expressed genes in worm heterochromatin have lower H3K9me3 enrich- 
ment across gene bodies compared to silent genes (Extended Data Fig. 2c 
and Supplementary Figs 36, and 37). There are also conspicuous differ- 
ences in the patterns of H3K27me3 in the three organisms. In human 
and fly, H3K27me3 is highly associated with silent genes in euchromatic 
regions, but not with silent genes in heterochromatic regions. In contrast, 
consistent with the worm-specific association between H3K27me3 and 
H3K9me3, we observe high levels of H3K27me3 on silent genes in worm 
heterochromatin, whereas silent euchromatic genes show modest enrich- 
ment of H3K27me3 (Extended Data Fig. 2c and Supplementary Fig. 36). 

Our results suggest three distinct types of repressed chromatin (Extended 
Data Fig. 3). The first contains H3K27me3 with little or no H3K9me3 
(human and fly states 10 and 11, and worm state 11), corresponding to 
developmentally regulated Polycomb-silenced domains in human and 
fly, and probably in worm as well. The second is enriched for H3K9me3 
and lacks H3K27me3 (human and fly states 12 and 13), corresponding 
to constitutive, predominantly pericentric heterochromatin in human 
and fly, which is essentially absent from the worm genome. The third 
contains both H3K9me3 and H3K27me3 and occurs predominantly in 
worm (worm states 10, 12 and 13). Co-occurrence of these marks is con- 
sistent with the observation that H3K9me3 and H3K27me3 are both 
required for silencing of heterochromatic transgenes in worms'®. H3K9me3 
and H3K27me3 may reside on the same or adjacent nucleosomes in 
individual cells'””*; alternatively the two marks may occur in different 


> 


marks. The genomic coverage of each state in each cell type or developmental 
stage is also shown (see Supplementary Figs 26-32 for detailed analysis of the 
states). States are named for putative functional characteristics. 


cell types in the embryos and larvae analysed here. Further studies are 
needed to resolve this and determine the functional consequences of 
the overlapping distributions of H3K9me3 and H3K27me3 observed 
in worm. 
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Figure 3 | Genome-wide organization of heterochromatin. a, Enrichment 
profiles of H3K9mel, H3K9me2, H3K9me3 and H3K27me3, and identification 
of heterochromatin domains based on H3K9me3 (illustrated for human 
H1-hESC, fly L3 and worm L3). For fly chr 2, 2L, 2LHet, 2RHet and 2R 

are concatenated (dashed lines). C, centromere; Het, heterochromatin. 

b, Genome-wide correlation among H3K9mel, H3K9me2, H3K9me3, 
H3K27me3 and H3K36me3 in human K562 cells, fly L3 and worm L3; no 
H3K9me? profile is available for human. c, Comparison of Hi-C-based and 
chromatin-based topological domains in fly LE. Heat maps of similarity 
matrices for histone modification and Hi-C interaction frequencies are 
juxtaposed (see Supplementary Fig. 40). 
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Genome-wide chromatin conformation capture (Hi-C) assays have 
revealed prominent topological domains in human” and fly’. Although 
their boundaries are enriched for insulator elements and active genes’””” 
(Supplementary Fig. 38), the interiors generally contain a relatively uni- 
form chromatin state: active, Polycomb-repressed, heterochromatin, or 
low signal’* (Supplementary Fig. 39). We found that chromatin state sim- 
ilarity between neighbouring regions correlates with chromatin inter- 
action domains determined by Hi-C (Fig. 3c, Supplementary Fig. 40 and 
Supplementary Methods). This suggests that topological domains can be 
largely predicted by chromatin marks when Hi-C data are not available 
(Supplementary Figs 41 and 42). 

C. elegans and D. melanogaster have been used extensively for under- 
standing human gene function, development and disease. Our analyses 
of chromatin architecture and the large public resource we have gen- 
erated provide a blueprint for interpreting experimental results in these 
model systems, extending their relevance to human biology. They also 
provide a foundation for researchers to investigate how diverse genome 
functions are regulated in the context of chromatin structure. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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Despite the large evolutionary distances between metazoan species, 
they can show remarkable commonalities in their biology, and this 
has helped to establish fly and worm as model organisms for human 
biology”. Although studies of individual elements and factors have 
explored similarities in gene regulation, a large-scale comparative 
analysis of basic principles of transcriptional regulatory features is 
lacking. Here we map the genome-wide binding locations of 165 human, 
93 worm and 52 fly transcription regulatory factors, generating a total 
of 1,019 data sets from diverse cell types, developmental stages, or 
conditions in the three species, of which 498 (48.9%) are presented 
here for the first time. We find that structural properties of regu- 
latory networks are remarkably conserved and that orthologous reg- 
ulatory factor families recognize similar binding motifs in vivo and 
show some similar co-associations. Our results suggest that gene- 
regulatory properties previously observed for individual factors are 
general principles of metazoan regulation that are remarkably well- 
preserved despite extensive functional divergence of individual net- 
work connections. The comparative maps of regulatory circuitry 
provided here will drive an improved understanding of the regula- 
tory underpinnings of model organism biology and how these relate 
to human biology, development and disease. 
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Figure 1 | A large compendium of regulatory binding across distant 
metazoans. Data from modENCODE and ENCODE consortia used in the 
analyses. Inner circles show the fraction of data sets presented here for the first 
time. For each organism the major contexts are shown in a different hue in 
the two outer circles. Asterisks, data sets that are not one of the main contexts. 
Each factor that underwent ChIP is shown in the middle circle and the count is 


Datasets: 219 
Factors: 93 
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Transcription regulatory factors guide the development and cellular 
activities ofall organisms through highly cooperative and dynamic control 
of gene expression programs. Regulatory factor coding genes are often 
conserved across deep phylogenies, their DNA-binding protein domains 
are preferentially conserved at the amino-acid level, and their in vitro 
binding specificities are also frequently conserved across large distances**. 
However, the specific DNA targets and binding partners of regulators 
can evolve much more rapidly than DNA-binding domains, making it 
unclear whether the in vivo binding properties of regulatory factors are 
conserved across large evolutionary distances. 

Comparisons of the locations of regulatory binding across species has 
been controversial, with some studies suggesting extensive conserva- 
tion’**"'°, whereas others suggest extensive turnover'’"*. Although it 
is generally assumed that across very large evolutionary distances reg- 
ulatory circuitry is largely diverged, there exist highly conserved sub- 
networks’*'’. Thus, confusion exists in the level of regulatory turnover 
between related species, possibly owing to the small number of factors 
studied. Moreover, despite recent observations of the architecture of 
metazoan regulatory networks a direct comparison of their topology and 
structure—such as clustered binding and regulatory network motif—has 
not been possible owing to large differences in the procedures employed 
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in parentheses (a factor can be represented in multiple contexts). The outer 

circle shows every data set, scaled by the number of peaks. Red, polymerase; 
light shades, transcription factor; dark shades, other. ChIP of a total of 165, 

93 and 52 unique factors across all conditions and cell lines in human, 

and developmental stages in worm and fly, respectively. 
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Figure 2 | HOT regions are dynamically established across stages and cell 
types. HOT regions contain binding sites for a large number of factors. a, A 
total of 2,948, 2,283, and 46,348 HOT regions exist, of which 29.1%, 13.7% 
and 9.7% are constitutive in worm, fly and human respectively. A large fraction 
of HOT regions are shared across multiple contexts but the majority of 

HOT regions are specific to a single context. b, Constitutive human HOT 
(cHOT) regions show strong enrichment for promoters while cell-type specific 
(GM12878 (GM), H1hesc (H1), HepG2 (HG), HelaS3 (HL), K562 (K5)) HOT 
regions show more enhancer enrichment (see also Extended Data Fig. 3). 
The cell type/context of the classes is indicated on top. Matched indicates that 
the classes are derived from the specific cell type analysed in each set. 


to assay regulatory factor binding in distinct species. Here we present a 
systematic and uniform comparison of regulation using many factors 
across distantly related species to help address these questions ona scale 
not previously possible. 

To compare regulatory architecture and binding across diverse organ- 
isms, the modENCODE and ENCODE consortia mapped the binding 
locations of 93 Caenorhabditis elegans regulatory factors, 52 Drosophila 
melanogaster regulatory factors and 165 human regulatory factors as a 
community resource (Fig. 1 and Supplementary Table 1). These regu- 
latory factor binding data sets represent a substantial increase over those 
previously published for worm (194 new data sets for a total of 219) and 


human (211 new, 707 total) and a substantial improvement in data 
quality in fly with a move from chromatin immunoprecipitation with 
DNA microarray (ChIP-chip) to ChIP followed by sequencing (ChIP- 
seq) (93 new, 93 total)**°”°. The majority of regulatory factors are site- 
specific transcription factors (83 in worm, 41 in fly, and 119 in human), 
although general regulatory factors such as RNA Pol II were also assayed. 

All regulatory factors were analysed by ChIP-seq according to mod- 
ENCODE/ENCODE standards: antibodies were extensively character- 
ized, and at least two independent biological replicates were analysed”". 
Worm regulatory factors were assayed in embryo and stage 1-4 larvae 
(L1-L4 larvae), fly regulatory factors in early embryo, late embryo and 
post embryo, and human regulatory factors in myelocytic leukaemia K562 
cells, lymphoblastoid GM12878 cells, H1 embryonic stem cells, cervical 
cancer HeLa cells, and liver eptihelium HepG2 cells. Binding sites were 
scored using a uniform pipeline that identifies reproducible targets using 
irreproducible discovery rate (IDR) analysis (Extended Data Fig. 1)” 
and quality-filtered experiments (see Methods). These rigorous quality 
metrics insure that the data sets used here are robust. All data presented are 
available at http://www.ENCODEProject.org/comparative/regulation/. 

To explore motif conservation, we examined the 31 cases in which 
we had members of orthologous transcription-factor families profiled 
in at least two species (Extended Data Fig. 2a and Methods). Sequence 
enriched motifs were found for 18 of the 31 families and for 12 ortho- 
logous families (41 regulatory factors), the same motif is enriched in 
both species (Extended Data Fig. 2b, c). For 18 of 31 families (64 of 93 
regulatory factors), the motif from one species is enriched in the bound 
regions of another species (one-sided hypergeometric, P = 3.3 X 10 *). 
These findings indicate that many factors retain highly similar in vivo 
sequence specificity within orthologous families, a feature noted previ- 
ously across studies working on smaller numbers of factors. 

Next, we used RNA-seq data’ to determine whether targets of ortho- 
logous regulatory factors are specifically expressed at similar develop- 
mental stages between fly and worm. As a class, orthologous regulatory 
factors (both assayed here and not) are significantly expressed at similar 
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stages (Extended Data Fig. 3a—c). However, expression of orthologous 
targets of orthologous regulatory factors in worm and fly shows little 
significant target overlap (Extended Data Fig. 3d) and the large majority 
of orthologous regulatory factors did not show conserved target functions 
(Extended Data Fig. 4a—c), suggesting extensive re-wiring of regulatory 
control across metazoans. Nevertheless, human and worm orthologous 
regulatory factors were more likely to show conserved target gene func- 
tions than non-orthologous regulatory factors (Extended Data Fig. 4d, 
Wilcoxon test P < 3.9 X 107°), highlighting regulatory factors with 
conserved target functions. 

Regulatory factor binding is not randomly distributed throughout the 
genome, but rather, in all three species, approximately 50% of binding 
events are found in highly-occupied clusters, termed high-occupancy 
target (HOT) regions'”**'°. HOT regions show enhancer function in 
integrated transcriptional reporters” and are stabilized by cohesin’*”. 
HOT regions show no significant enrichment with non-specific anti- 
bodies (Extended Data Fig. 5), in contrast to recent work using raw signal’? 
rather than IDR peaks, although the possibility that they are artefacts 
has been raised. 

By comparing HOT regions across different developmental times and 
cells types, we find that 5-10% of HOT regions are constitutive, indi- 
cating that HOT regions are dynamically established, rather than an 
intrinsic property of specific regions. In humans we find that approxi- 
mately 90% of constitutive HOT regions fall within promoter chromatin 
states compared to only approximately 10-20% of context-specific HOT 
regions (Fig. 2a and Extended Data Fig. 6). Instead, approximately 80- 
90% of context-specific HOT regions fall within enhancer states. More- 
over, these context-specific HOT regions are specifically enriched for 
enhancers in matching cell types or developmental stages. For exam- 
ple, 80% of GM12878-called HOT regions fall within GM12878-specific 
enhancers but only approximately 10% of GM12878-called HOT regions 
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fall within enhancers called in other cell-types (Fig. 2b). These patterns 
remain similar for all cell types (Extended Data Fig. 7), suggesting the 
two types of HOT regions are established concordantly and dynam- 
ically between cell types, though these patterns are weaker in the worm 
and fly data. 

We constructed regulatory networks in each species by predicting 
gene targets of each regulatory factor using TIP” and used simulated 
annealing to reveal the organization of regulatory factors in three layers 
of master-regulators, intermediate regulators, and low-level regulators 
(Fig. 3a, b). The algorithm found only 7% of regulatory factors at the 
top layer of the network in fly and 13% in worm, compared to 33% in 
human. We also found that more edges are upward flowing in human 
(30%) than worm and fly (22% and 7%). This suggests differences in 
the global network organization with more extensive feedback and a 
higher number of master regulators in human. 

We next assessed the local structure of regulatory networks, by search- 
ing for enriched sub-graphs known as network motifs (Fig. 3c). We 
found that the same network motifs were most and least enriched in the 
three species. In each case, the most abundant was the feed-forward loop 
(FFL), while the least abundant were cascade motifs, and both divergent 
and convergent regulation. Moreover, specific regulatory factors were 
enriched for origin, target, or intermediate regulators in these FFLs in 
each species (Fig. 3d). Surprisingly, the number of feed forward loops 
(FFLs) varied by developmental stage in both worm and fly, with L1 
stage in worm and late-embryo stage in fly showing the highest number 
of FFLs (Extended Data Fig. 8), suggesting increased filtering fluctua- 
tions and accelerating responses in these stages™*. 

We asked whether the three species showed conserved regulatory 
factor co-associations. We first focused on global co-associations where 
two factors co-associate frequently regardless of context, either by inter- 
molecular interactions or independent recruitment (Extended Data 
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Fig. 9). With the exception of a small number of conserved global reg- 
ulatory factor co-associations (for example, SIN3A with HDAC1, HDAC2 
and NR2C?2 in fly and human**”, and MXI1 with E2F1, E2F4 and E2F6 
in worm and human), the majority of global co-associations were not 
conserved in the contexts and species pairs analysed. 

As regulatory factor co-association at distinct binding regions is local 
and contextual (that is, different combinations of factors co-associate 
at different genomic locations), we next used an approach to detect co- 
association at distinct regions of the genome based on conserved pat- 
terns of regulatory factor binding. This method uses self-organizing maps 
(SOMs) to analyse co-association patterns at specific loci by better explor- 
ing the full combinatorial space of regulatory factor binding than tra- 
ditional co-association approaches (Fig. 4a—-c)**. We demonstrate that 
co-associations at distinct genomic regions reveal a more complex view 
of regulatory structure and bring forth categorical enrichments that are 
lost in a larger, genomic context. 

We examined whether specific contextual co-associations are con- 
served for orthologous regulatory factors by using binding data from 
each organismal pair; that is, human-worm and human-fly (Fig. 4b, g). 
Specific regulatory factor co-associations were observed; most are con- 
served to varying degrees across each organism with very few that are 
entirely organism-specific (Fig. 4b, g). These co-associations result in 
expected sets of factors such as the previously noted SIN3A + HDAC 
co-association. In addition, we find new co-associations such as the pat- 
tern in Fig. 4f for human-worm, which in worm is highly enriched for 
GO terms associated with sex determination. We further examined which 
co-associations are conserved at distinct gene locations (that is, proximal 
and distal). We found distinct combinations of conserved co-associations 
in relation to transcription start site (TSS) regions. Interestingly, virtu- 
ally all TSS-proximal co-associations in human remain TSS-proximal 
in worm (approximately 80%) and fly (approximately 100%), indicat- 
ing that co-associations that occur at promoters are often highly con- 
served (Fig. 4h). Conversely, co-associations at distal regions are much 
less conserved. 

Our results, obtained using a large resource of regulatory binding infor- 
mation, suggest that there is little conservation of individual regulatory 
targets and binding patterns for these highly divergent metazoans: C. 
elegans, D. melanogaster and H. sapiens. However, we do find strong con- 
servation of overall regulatory architecture, both in network motif usage 
and in concentrated regulatory binding at dynamically established HOT 
regions. We observe an increased conservation of in vivo sequence pref- 
erences and some target gene functions, with context-specific regulatory 
factor partners still observed at specific loci in these distal comparisons. 
These findings are consistent with previous results indicating that the 
gene targets of regulation are typically quite divergent and are likely to 
account for many of the phenotypic differences among species'?""*°7?, 
despite conserved sequence preferences. We significantly extend these 
observations, both in the number of regulators studied and in the range 
of regulatory properties studied, and provide specific examples of con- 
served and diverged regulatory functions. Lastly, beyond its potential for 
comparative studies of gene regulation, the primary data sets provide 
invaluable new information of genome-wide transcription-factor bind- 
ing information both in human, and in two of the most important meta- 
zoan models of human biology, development, and disease. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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COLUMN 
Bold moves 


People seeking non-academic jobs may need to try 
something unexpected to be noticed, says Peter Fiske. 


chimpanzees in Gombe, Tanzania, she 

observed one young male who devised an 
unusual means of raising his status. Dubbed 
Mike by Goodall’s researchers, the youngster 
had been bullied by higher-status males, rarely 
received portions of food from others and was 
locked out of any potential for procreation 
(H. Kummer and J. Goodall Phil. Trans. R. Soc. 
Lond. B 308, 203-214; 1985). 


lE Jane Goodall’s landmark study of 


But one day Mike discovered that when he 
accidentally knocked against the large metal 
kerosene cans lying around Goodall’s camp, the 
racket startled the other members of the chim- 
panzee group. One afternoon, Mike ran into 
the centre of the group, batting and banging on 
one of the cans and scattering all the chimpan- 
zees. Then he sat quietly in the centre of the 
clearing until the other chimpanzees slowly 
returned to gather around him. The females 


began grooming him, and the other males, 
even the largest alphas, gave him a wide berth. 

In one afternoon, Mike went from one of the 
lowest-status members of the group to one of 
the highest. He had adopted a strategy known 
in job-seeking circles as the bold move. 

A bold move is a calculated strategy to 
draw attention to yourself through a dra- 
matic and unusual act. It signals to a potential 
employer that you have creativity, drive and the 
willingness to take risks to achieve something 
important. When executed convincingly and 
with sincerity, the bold move can completely 
reset an employer's perceptions of your abili- 
ties and potential, because in the world outside 
academia, the bold move can get you ahead. 


TICK TOCK 

In a talk last year to PhD students at the 
University of California, Berkeley, author and 
lecturer Kristina Susac told of a bold-move 
strategy she had successfully used in a former 
sales post to nab a face-to-face sales meeting 
with an evasive prospect, the chief information 
officer of a large company. After many failed 
attempts to reach the executive through phone 
calls and e-mails, Susac realized that these 
methods were not going to produce a meeting. 

So she sent the executive a box by courier. 
Inside the box was a large alarm clock with a 
tag that read, “Isn't it time that we met?” Susac’s 
phone rang the next morning: it was the execu- 
tive. “OK, I give in — that was one of the most 
unusual and creative stunts that I have seen 
from any salesperson,” the officer told her. 
“Why dont you come in next week and we can 
discuss what you have to offer.” 

I told this story to a graduate student who 
was keenly interested in getting an intern- 
ship as a copywriter in an advertising firm. 
Although he was an excellent and creative 
writer and editor, he lacked the copywriting 
experience that was listed as a non-negotiable 
requirement in all the job advertisements 
that he had seen. So after a brief informa- 
tional interview with a member of the firm, 
he designed a ‘campaign’ T-shirt that bore an 
image of his face in profile, his name and the 
words “Running for Intern, 2014”. He sent the 
T-shirts to the firm's vice-president and head 
of creative products, and persuaded several 
other employees whom he had met during his 
interview to wear the T-shirts. He got the post. 

Early-career scientists often cringe when 
they first hear about the bold-move strategy. 
After spending years at the bottom of the aca- 
demic hierarchy, many young researchers > 


28 AUGUST 2014 | VOL 512 | NATURE | 457 


© 2014 Macmillan Publishers Limited. All rights reserved 


FUNDING 
Donation expectations 


US higher-education institutions reported 
that the value of charitable donations 

they received grew 5.2% for the 2013-14 
academic year, according to survey results 
from the Council for Advancement 

and Support of Education (CASE) in 
Washington DC. The institutions also 
predicted a growth of 5.7% for 2014-15. 
Community colleges, which offer basic 
tertiary education, reported a 7.2% 
growth in giving for 2013-14, compared 
with 5% for public and private universities 
that offer full, four-year degrees. The 
average yearly growth over 20 years is 
5.9%. CASE polls institutions twice a year 
on actual and predicted giving. 


TEACHING 


Mentoring programme 


The New York Academy of Sciences 

and Sister Cities International in 
Washington DC are collaborating on 

a mentoring programme in science, 
technology, engineering and maths 
(STEM) that will match graduate and 
undergraduate students with STEM 
professionals. The STEMentoring 
Program will draw mentors and students 
from Sister City volunteers around the 
world and match them through an online 
platform. Students will be able to connect 
with their counterparts and professionals 
globally through a virtual network, 

in which they can also participate in 
seminars and complete coursework. The 
programme will also foster collaboration 
in real-world STEM -related issues, such 
as water availability, in communities 
worldwide. 


JOB APPLICATIONS 
Lies have consequences 


Early-career researchers looking for 
positions in industry can probably wave 
goodbye to a job if they lie on their CVs, 
finds a survey of US hiring managers and 
human-resources executives. Conducted 
online from 13 May to 6 June by Harris 

Poll in Rochester, New York, on behalf of 
the jobs website CareerBuilder, the survey 
polled 2,188 respondents across various 
industries and company sizes. Some 51% of 
respondents said that they would dismiss 
an applicant on whose CV they found a lie, 
whereas 7% said that they would overlook 
it if they liked the candidate. Respondents 
said that 33% of detected untruths involved 
an applicant’s academic degree, and 57% 
were used to embellish skill sets. 
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Primatologist Jane Goodall observed a young chimpanzee making a bold move to earn social status. 


believe that their careers will progress only if 
they patiently work their way up the ladder, 
with advancement coming as the ‘silverbacks’ 
— people reaching retirement age — retreat. 
Sudden elevations in status are virtually 
unheard of in the slow-moving, heavily cir- 
cumscribed academic universe, even after 
publication of a world-class discovery. 

In the academic culture, graduate students 
learn that to qualify for positions in their 
disciplines, they must be widely recognized 
experts, and that the only positions that they 
should consider are those closely aligned 
with their areas of research. The academic 
establishment views 


self-promotion with “As an emp loyee, 
deep suspicion: the adaptability and 
work should speak willing: ness to 
for itself. If gradu- learnaremore 
ate students try to important to 
draw attention to cCareersuccess 
themselves or their thanis technical 


work, some fac- 
ulty members and 
department heads might see those efforts as 
a signal that the work is sub-par. 

Although these cultural norms may be 
de rigueur in academia, they leave early- 
career researchers maladapted to the job 
market outside it. Expertise is the foremost 
qualification for a job inside academia, but 
in the outside world, attitude is at least as 
important to potential employers. In aca- 
demia, a researcher can spend years focusing 
ona single problem or technical area. But in 
other, profit-driven sectors, employers know 
that priorities, opportunities and technologies 
change quickly. As an employee, adaptability 
and willingness to learn are more important 
to career success than is technical expertise. 

If a young PhD-holder is modest about 
his or her talents when applying for non- 
academic jobs, it can reinforce the perception 


expertise.” 


14 
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among potential employers that he or she is 
uncertain, hesitant about taking the initia- 
tive and poorly suited to any job other than 
academic research — for which, of course, 
few open positions exist. 

A bold move is, of course, inherently risky: 
a potential employer may view your attempt 
to set yourself apart as inappropriate. There 
is no way to know for certain whether your 
bold move is on target, but discussing it with 
a few friends or a mentor may help you to 
craft a gesture that is distinctive without 
being bizarre. As a science-trained PhD- 
holder, if you seek employment outside aca- 
demia — and greater economic reward for 
your years of investment in graduate school 
— you need to be able to retain the qualities 
of honesty and intellectual integrity while 
adapting to the cultural norms of the non- 
academic working world. Understand that 
your graduate-school experience has given 
you a broad set of transferable skills that, 
when combined with your intelligence and 
resourcefulness, will enable you to succeed 
in a wide range of jobs and roles. 

Some amount of self-advocacy is essential. 
If you do not assert what you believe you are 
capable of doing — and project a positive, 
confident attitude — employers will not take 
the risk of overlooking your lack of experi- 
ence. Sometimes it is important to make that 
bold move and to take what might feel like a 
risky step. Young PhD researchers tend to be 
risk-averse, but what may seem like a risky 
move to them is probably not that risky at all. 
The occasional bold move sends the signal to 
potential employers that you are more than 
just your PhD — and that you are ready to 
move up and beyond the rest of the pack. = 


Peter Fiske is chief executive of PAX Water 
Technologies in Richmond, California, and 
author of Put Your Science to Work. 
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THE ANGLE OF THE LIGHT ON THE 
BLOODSTAINED KITCHEN FLOOR 


BY MATT MIKALATOS 


octor Wallace told me that the 
D 22 November 1963 would be my 

assigned topic for the final exam. 
I’ve studied and crammed and memorized 
and watched holos and paid for memory 
boosts. But when I put on the 
Chronovisor, it doesn't show 
me Dallas, it shows me Tony 
Acerbi’s apartment. 

So Ido the research while 
my exam timer counts down. 
Dr Wallace has given me a 
two-week window of chronal 
study, culminating here on 
the 22nd. 

I learn that Tony Acerbi 
lives in a fetid hole of an 
apartment. He has way more 
cockroaches than friends. 
He wears slacks and a filthy 
undershirt most of the day, 
and lives on boosted mer- 
chandise and odd jobs. His 
home is in a Chicago neigh- 
bourhood where you call 
the cops and make bets on 
whether they'll show. 60-40 they don't. 

Tony’s neighbour is this old black man 
named Ollie Lewis. He’s tied with Tony for 
most hated man in the complex. Everyone's 
pretty sure he poisoned the barking dog in 
3A, and he definitely empties his ashtray into 
Mrs Fortuna’ flower box. 

At 3.42 p.m. on 22 November, two teen- 
agers break into Ollie’s place. 

Tony is sitting in his easy chair smoking 
a cigarette and he hears the muffled sounds 
of the old man next door getting worked 
over. He jumps up and, without putting on 
a jacket, or a shirt for that matter, he runs 
through the November cold and busts into 
Ollie’s apartment. 

Ollie is on the floor, curled in a ball. One 
kid has a Louisville Slugger, the other has 
steel-toed boots. Tony takes a couple of 
swings at them, but they’re not interested. 
They keep taking jabs at Ollie. So Tony lays 

down over Ollie, cov- 
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4 p.m. Friday 22 November 1963. 


of their blows are hitting Tony. There is 
blood everywhere. Tony, at a certain point, 
just turns his head to one side and stares at 
the floor, and the fading sunlight through 
the dirty window illuminates every mote of 
dust, like a thousand bursting stars drifting 
through his neighbour's kitchen. 


I can’t go much beyond this moment in the 
kitchen, and I can't see places other than these 
two apartments. I can't figure out why the 
teens are doing this, or why Tony is protecting 
Ollie. I don't see a reason for him to lay himself 
down like a shroud over his neighbour. 

I click my teeth in annoyance that 
Dr Wallace has done this to me, given me this 
particular moment on this day of all days. I 
have to wonder if Tony knows how inconse- 
quential this moment is. A couple of hours 
from now, JFK takes a bullet to the head. 
Aldous Huxley and C. S. Lewis both breathe 
their last. Hell, today The Beatles release their 
second album. Doctor Who has its series pre- 
miere less than 24 hours from this moment. 

Two men ina dilapidated apartment com- 
plex being beaten senseless by teenagers, 
it means less than nothing. You don’t get 
grants, you don’t get book deals, you don't 
get tenure by studying unimportant events. 

The two men breathe shallowly and twitch 
on the yellowed linoleum. Their breath stirs 
ripples in the growing puddle of blood. The 
teenagers leave them, leaving the door open. 

Two and a half minutes later, Mrs Fortuna 
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happens by to water her flowers and when she 
sees the men collapsed in Ollie’s kitchen, she 
screams and runs for her phone and calls the 
police, the fire department, the ambulances, 
anyone, anyone, please God, just send anyone. 

I probably couldn't even write a thesis 
about Tony Acerbi or Ollie Lewis. There 
won't be a holo. They’re 
nobodies and the resources 
for chronal viewing are tight. 
Dr Wallace picked an obscure 
moment because he likes his 
students to observe details, to 
see things fresh, and it’s hard 
to do that with the big histori- 
cal events. So he chose this 
little corner of nowhere as a 
test, not because he thinks it’s 
important. 

What happened to these 
two doesn’t matter. Not to 
my grade, not to my future, 
not to history or the fate of 
the country. 

None of it matters. The 
identity of the teenagers, the 
amount of blood lost, the 
pattern in the wallpaper, the 
reason for the assault, the brand of Tony’s 
cigarettes, the angle of the light that illumi- 
nates the two men gasping for breath on the 
kitchen floor. It doesn’t matter, not at all, not 
one bit. 

Out the broken doorway I can see the 
windows of a hundred other apartments, a 
hundred other stories that matter just as little 
as this one. 

Still. 

I find myself hoping for wailing sirens and 
blue and red lights. I find myself praying that 
someone will come. 

And just like that, my time's up. Dr Wallace 
removes my Chronovisor and places it on the 
next student in line. 

“What happened?” I ask. 

He turns, a weary pity in his eyes and says: 
“You passed.” = 


Matt Mikalatos lives near Portland, 
Oregon. He has a podcast called the 
StoryMen (www.storymen.us), a website 
called Norville Rogers (www.norvillerogers. 
com) and an inability to time travel in any 
direction but towards the future. 
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